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ABSTRACT 

OBJECTIVE:       Medicare's  Average  Adjusted  Per  Capita  Cost 
(AAPCC)   risk  adjustment  method  does  not  adequately  adjust 
for  favorable  selection   (a  high  proportion  of  less  costly 
enrollees)   experienced  by  Medicare  HMOs  because  the  AAPCC 
overpredicts  the  bottom  75th  percentile  of  users  by  39%  for 
the  Medicare  aged.     Therefore,  Medicare  HMO  enrollees  cost 
the  program  more  than  they  would  have  under  f ee-f or-service 
coverage.     The  Health  Care  Financing  Administration  (HCFA) 
recently  funded  several  projects  to  develop  AAPCC 
alternatives .     This  study  develops  two  specific  condition 
risk  adjustment  models   (SPEC  I  &  II)   and  compares  results  to 
three  other  models;  an  AAPCC  proxy  and  two  models 
(ADG/HOSDOM  and  ADG/MDC)    derived  from  Ambulatory  Care  Groups 
(ACGs)   and  Payment  Amount  for  Capitated  Systems   (PACS)  . 

DATA  SOURCES:     Medicare's     5%     sample  enrollment  and  claims 
data  from  1991  and  1992. 

DATA  EXTRACTION  PROCESS:         Data  extracts  and  analytic  files 
developed  from  these  data  are  used  to  define  split  random 
samples,   dependent  and  independent  variables  for  all  models. 
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STUDY  DESIGN  AND  METHODS:         Split  random  samples 
(development  N=618,418  validation  N=618,772)   are  employed  to 
develop  and  test  an  ordinary  least  squares  linear  regression 
model  using  conditions  diagnosed  in  1991  to  predict  1992 
Medicare  expenditures  controlling  for  gender,  age, 
disability  and  Medicaid  status.  Specific  conditions  are 
defined  using  diagnoses  code  ranges  specified  in  the  1991 
ICD-9-CM  coding  manual  "List  of  Three-Digit  Categories"  . 
Samples  include  persons  age  65  and  over  continuously 
enrolled  in  Part  A  and  B  for  both  study  years  and  alive  at 
year  end  1991.  Beneficiaries  with  ESRD,  under  age  65  with 
disabilities  or  in  an  HMO  are  excluded. 

PRINCIPAL  FINDINGS:   Specific  condition  models  rate  high  on 
four  criteria  for  effective  risk  adjustment  models: 
predictive  accuracy,   administrative  feasibility,   ease  of 
auditing  for  fraud  or  gaming  and  incentives  for  cost 
effective  health  care.     Specific  condition  model  individual 
(SPEC  I  R2  =  0.073,   SPEC  II  R2  =  0.075)    and  group  predictive 
accuracy   (predictive  ratio  PR=1.03  for  both  models),   in  the 
aggregate,   is  comparable  to  two  more  complex  models, 
ADG/HOSDOM      (IBDf  R2=0.055  PR=1.02)    and  ADG/MDC  (32Df 
R2=0.063  PR=1.02).     Aggregate  group  predictive  accuracy 
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(PR=1.01)   for  the  AAPCC  proxy  is  comparabre  to  the  four 
comparison  models  but  performance  is  lower  for  group 
accuracy  across  systematically  biased  samples.     When  - 
accuracy  for  groups  biased  with  respect  to  characteristics 
of  the  enrolled  population  in  1991   (high  and  low 
expenditures,   hospital  utilization  and  presence  of  four 
chronic  diseases)   is  compared  across  all  five  models, 
specific  condition  models  rank  first   (SPEC  II)   and  third 

(SPEC  I) ,  ADG/MDC  ranks  second,  ADG/HOSDOM  ranks  fourth  and 
the  AAPCC  ranks  last .     Ranking  is  based  on  consistancy  of 
accuracy  across  all  biased  groups.     Specific  condition 
models  are  comparable  to  comparison  models  with  respect  to 
remaining  criteria  for  effective  risk  adjusters: 
administrative  feasibility,   incentives  for  cost  effective 
health  care  and  ability  to  audit  for  fraud  or  gaming. 
CONCLUSIONS:       Specific  condition  risk  adjustment  models  are 
viable  alternatives  to  Medicare's  AAPCC. 
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CHAPTER  I  INTRODUCTION 

Rising  health  care  costs  and  increases  in  the  number  of 
uninsured  and  underinsured  sparked  health  reform  debates  at  the 
federal  and  state  levels  in  the  early  1990s     (Blendon,  1992)  that 
persist  in  1997.    The  goal  of  recent  health  reform  initiatives  is 
to  control  rising  costs  while  preserving  access  to  and  quality  of 
health  care. 

Private  and  public  payers  (e.g.,  employers,  Medicare  and 
Medicaid)  perceive  expanded  enrollment  in  prepaid  managed  care 
organizations  (MCOs)  as  one  solution  to  current  health  care  cost 
containment  and  access  problems  (Luft,  1986;  Miller,  1994) .  There 
is  evidence  that  MCOs  have  reduced  costs  primarily  by  controlling 
inpatient  utilization  (Brown,  1993;  Bradbury,  1991;  Miller,  1994; 
Stern,  1989)  .     However,  in  our  health  care  system  where  there  is 
choice  among  multiple  insurance  plans,  without  effective  risk 
adjustment,  expanded  prepaid  MCO  enrollment  can  result  in  higher 
overall  costs  due  to  selection  bias   (Brown,  1993;  GAO,  1994; 
Hornbrook,   1991;  Luft,   1985  1986  1988  1995)  . 

"Selection  Bias"  exists  when  enrollees  in  one  plan  are  more 
or  less  sick  than  the  average  person  in  the  market  used  to 
estimate  rates  required  to  cover  the  cost  of  care  for  those 
enrollees.     "Favorable  selection"  occurs  when  plan  enrollees  are 
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healthier  than  the  norm  for  that  market  for  whom  a  lower  than 
average  rate  would  suffice.     "Adverse  selection"  exists  when  plan 
enrollees  are  sicker  than  the  norm  and  require  a  rate  that  is 
higher  than  the  average.    (Hellinger,  1995;  Lubitz,  1987;  Luft, 
1985  1986  1988  1995;  Wrightson  1990)  .     This  definition  is  a 
composite  of  several  similar  definitions  in  the  literature 
reviewed  in  Chapter  II  (Anderson  GF,   1986;  Anderson  RV,  1991; 
Hornbrook  1991b;  Luft  1986)  . 

Recent  evidence  indicates  there  is  favorable  selection  among 
Medicare  HMOs  and  among  HMOs  and  PPOs  in  the  private  sector 
(Hellinger,  1995)  .     Given  favorable  selection  for  prepaid  MCOs  and 
adverse  selection  for  participating  indemnity  plans,  overall 
health  care  costs  will  increase  (Brown,  1993;  GAO,  1994)  .  This 
problem  is  exacerbated  when  prepaid  rates  are  based  on  the  average 
costs  for  the  FFS  segment  of  the  market,  as  they  are  in  Medicare, 
because  FFS  costs  are  higher  than  the  overall  average  due  to 
adverse  selection  (Brown,   1993;  Luft,   1986  1988) . 

There  are  several  ways  to  reduce  selection  bias:  national 
health  insurance,  mandatory  reinsurance,  mandatory  community 
rating,  random  assignment  of  employees  to  health  plans,  and  extend 
enrollment  or  responsibility  periods  (Luft,  1995)  .  Allowing 
selection  bias  to  occur  and  adjusting  payments  to  health  plans 
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based  on  their  enrollee  risk  profile  relative  to  the  norm  for  the 
market,   is  probably  simpler  than  attempting  to  restructure  or 
control  competitive  market  forces   (Luft,   1988  1995)  .  Risk 
adjusting  payments  to  health  plans  based  on  relative  risk 
neutralizes  incentives  for  and  effects  of  selection  bias  by 
adjusting  payments  upward  for  plans  with  sicker  enrollees  and 
downward  for  plans  with  healthier  enrollees  (Lubitz,  1987;  Luft, 
1986  1995) . 

In  order  for  risk  adjustment  to  mitigate  effects  of  selection 
bias,  effective  risk  adjustment  models  must  be  available  for  each 
market   (e.g..  Medicare,  Medicaid,  large  employer  groups,  the  small 
group  and  individual  market) .     Risk  adjustment  models  are 
"effective"  if  they  are  accurate  predictors  of  cost  especially  for 
biased  subgroups  of  enrollees,  administratively  feasible,  provide 
incentives  for  cost  effective  health  care  practices  and  are  easy 
to  audit  but  not  easy  to  manipulate  (Anderson,  1990;  Ash,  1989; 
Lubitz,  1987;  Luft,  1988  1995).     Despite  extensive  risk  adjustment 
model  development  research  over  the  past  two  decades,  the  search 
continues  for  more  effective  risk  adjustment  models  (Newhouse, 
1994;   GAO,   1994) . 

Medicare  uses  an  Average  Annual  Per  Capita  Cost (AAPCC)  model 
for  risk  adjusting  payments  to  prepaid  MCOs .     Medicare  capitation 
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payments  are  calculated  using  95%  of  the  US  average  annual  cost 
for  FFS  enrollees  as  the  base  rate.     That  rate  is  adjusted  for 
geographical  differences  in  costs  by  county.     The  geographically 
adjusted  base  rate  is  then  risk  adjusted  for  the  expected  cost  of 
four  risk  factors  in  Medicare's  risk  model:  age,  gender.  Medicaid 
eligibility  and  nursing  home  residence.    Risk  factor  costs  are 
also  calibrated  using  FFS  enrollee  costs  (Anderson,  1990)  . 

Medicare's  AAPCC  does  not  adequately  adjust  for  favorable 
selection  in  prepaid  HMOs  and  Competitive  Medical  Plans 
(CMPs) (Brown  1993,  GAO  1994) .     The  AAPCC  overpredicts  costs  for 
low  utilizers  and  underpredicts  high  utilizers.  Consequently, 
Medicare's  prepaid  MCO  enrollees  are  estimated  to  cost  the  program 
5.7%  more  than  they  would  have  had  they  remained  under  fee-for- 
service  (Brown,   1993)  . 

Numerous  studies  conducted  through  the  1980s  demonstrate 
various  AAPCC  alternatives  have  greater  predictive  accuracy 
(Anderson,   1990;  Ash,   1989;  Ellis  and  Ash,   1995;  Ellis,  1996; 
Epstein  and  Cumella,  1988;  Gruenberg,  1996;  Weiner,  1996)  .     The  US 
General  Accounting  Office  (GAO)  issued  a  report  in  September  of 
1994  calling  for  development  of  better  risk  adjusters  for  Medicare 
(GAO,  1994)  .     Four  "families"  of  adjusters  are  identified  as  the 
most  promising  AAPCC  alternatives:  Self -report  measures,  simple 
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clinical  measures  (e.g.  history  of  heart  disease),  a  more  complex 
clinical  measure  such  as  the  Ambulatory  Care  Group  (ACG)  method 
and  a  system  that  combines  clinical  and  utilization  data  such  as 
the  Diagnostic  Cost  Group  (DCG)  method  (GAO  1994)  .     The  Health 
Care  Financing  Authority  (HCFA) ,  which  has  federal  responsibility 
for  Medicare  and  Medicaid  programs,  funded  a  number  of  research 
projects  to  address  the  risk  adjustment  problem  for  Medicare 
including  a  dissertation  grant  for  this  research  (HCFA,  1995; 
Ingber,   1996) . 

Development  of  effective  risk  adjustment  models  is  an 
integral  part  of  health  reform  proposals  at  all  levels  of 
government  (AAOA,   199'4;  Dunn,   1995;  GAO,    1994  1994b;  Kronick, 
1995;  Luft,  1995;  Newhouse,  1994;  Shewry,   1996)  .     Effective  risk 
adjustment  is  necessary  in  order  to  minimize  incentives  for  risk 
segmentation  inherent  in  methods  for  setting  capitation  rates. 

This  research  develops  and  validates  the  effectiveness  of 
specific  condition  risk  adjustment  models  to  support  Medicare 
program  reform.     This  objective  is  particularly  timely  and 
relevant.      Expanded  enrollment  in  prepaid  managed  care 
organizations  is  being  tested  in  the  "Choice"  Demonstration 
Project  initiated  by  Medicare  in  1996  (HCFA  CHOICE  RFP,   1995) . 
Two  aspects  of  the  "Choice"  demonstration  promote  expanded 
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enrollment  in  MCOs.     First,  demonstration  sites  are  selected  based 
on  low  MCO  enrollment  for  that  geographic  area   (e.g.  Atlanta  GA, 
Columbus  OH,  Houston  TX) .     Second,  health  plan  qualification 
criteria  are  more  liberal  than  criteria  for  TEFRA  Risk  Plans  and 
Competitive  Medical  Plans   (CMPs)  making  competing  for  Medicare 
demonstration  site  awards  more  attractive  to  MCOs  (HCFA  CHOICE 
RFP,  1995) . 

Risk  adjusted  prospective  payments  to  health  plans  are 
integral  aspect  of  the  "Choice"  Demonstration  Project.  MCOs 
applying  for  demonstration  site  awards  had  to  submit  an  Adjusted 
Community  Rate   (ACR)  proposal  consisting  of  three  parts:  payment 
methodology,  risk  adjustment  methodology  and  outlier  payment 
arrangement  with  HCFA  (HCFA  CHOICE  RFP,   1995) .     In  the  "Choice" 
demonstration  RFP,  the  HCFA  indicated  two  alternatives  to  the 
AAPCC  would  be  available  to  participating  health  plans;  ACG/PACs 
derivative  models  (Weiner,  1996)  used  here  as  criterion  measures 
and  a  DCG  derivative  developed  concurrently  at  Health  Economics 
Research  Incorporated  in  Boston  (Ellis,  1996) . 

The  goal  of  this  research  is  to  examine  whether  simple 
Specific  Condition  risk  adjustment  models  are  viable  alternatives 
to  Medicare's  AAPCC  for  the  Medicare  aged  population.     This  goal 
is  achieved  by  developing  two  Specific  Condition  models  and 
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comparing  them  with  three  other  models .     The  three  comparison 
models  are  an  AAPCC  proxy  model  and  two  more  complex  clinical 
measures  intended  for  use  in  the  "Choice"  Demonstration  Project: 
the  ADG/HOSDOM  and  ADG/MDC  models   (Weiner,  1996) .     The  ADG/HOSDOM 
and  ADG/MDC  models  are  derived  specifically  for  the  Medicare  aged 
population  from  Ambulatory  Care  Group  (ACG)    (Starfield,  1991; 
Weiner,  1991  1996)  and  Payment  Amount  for  Capitated  Systems  (PACS) 
(Anderson,  1990)   risk  adjustment  models  developed  at  Johns  Hopkins 
University. 

Four  evaluation  criteria  are  employed  in  comparing  the  five 
models;  l)predictive  accuracy,  2 ) administrative  feasibility, 
3) incentives  created  by  the  model  to  achieve  desired  objectives, 
and  4)  ease  of  manipulating  and  auditing  the  system  (Anderson, 
1990;  Ash,   1989;  Dunn,   1995;  Fowles,  1994;  Hornbrook  1991a; 
Weiner,  1996)  .     To  be  considered  viable  alternatives  to  the  AAPCC 
in  this  research,  the  two  Specific  Condition  models  must 
outperform  the  AAPCC  and  be  at  least  comparable  to  the  ADG/HOSDOM 
and  ADG/MDC  models  for  all  four  evaluation  criteria.  The 
ADG/HOSDOM  and  ADG/MDC  models  serve  as  criterion  measures  to 
assess  validity  of  Specific  Condition  risk  adjustment  models  for 
the  Medicare  aged  population. 


It  is  not  intended  that  the  two  Specific  Condition  models 
presented  should  be  considered  for  implementation  as  AAPCC 
alternatives.      Additional  research  is  needed  to  develop  and  test 
a  simple  specific  condition  model  for  the  Medicare  aged  for  use 
with  both  claims  and  survey  data.    Automated  encounter  level  data 
is  not  available  from  all  health  plans.     Start-up  time  and  costs 
for  making  claims  data  available  are  a  high  administrative  burden 
for  health  plans   (Fowles,  1994)  .     Recent  research  indicates  risk 
adjustment  models  comprised  of  a  list  of  self -reported  conditions 
have  predictive  accuracy  comparable  to  more  complex  claims-based 
models  (Gruenberg,  1996;  Fowles,  1994  1996)  and  should  be 
considered  where  claims  are  not  available  (Fowles,  1994  1996)  . 

One  model  with  a  limited  list  of  conditions  (e.g.  15  to  3  0 
conditions)  measured  by  diagnosis  codes  present  in  claims  or 
survey  questions  regarding  presence  of  the  condition  could  be 
applied  to  risk  adjust  prospective  payments  for  participating 
Medicare  MCOs .  For  example,  risk  adjustment  amounts  could  be 
calibrated  from  claims  and  survey  data  collected  via  the  MCBS 
and/or  the  Medicare  Registry  and  applied  to  rates  proposed  by  MCOs 
depending  on  the  proportion  of  their  enrollee  populations  with 
that  list  of  conditions  relative  to  the  norm  for  the  Medicare 
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population.  Additional  research  is  required  to  develop  and  test 
such  a  model  using  both  claims  and  survey  data. 

The  remaining  six  chapters  present  an  in  depth  review 
of  risk  adjustment  in  the  context  of  the  health  insurance 
market,  the  state  of  the  art  in  risk  adjustment  model 
development,   research  methods  employed  in  developing 
specific  condition  models,   findings  regarding  the 
effectiveness  of  those  models  relative  to  three  comparison 
models   (AAPCC,  ADG/HOSDOM,  ADG/MDC)   and  a  discussion  of  the 
implications  and  limitations  of  those  findings.     The  thesis 
concludes  with  a  chapter  that  summarizes  findings  and 
discusses  opportunities  for  future  research. 

Chapter  II,   the  first  part  of  the  literature  review, 
addresses  risk  adjustment  in  the  context  of  the  health 
insurance  market  and  health  reform.   Issues  reviewed  include 
the  evolution  and  dynamics  of  the  health  insurance  market, 
managed  care  organizations (types  and  market  share),  risk 
adjustment  related  to  prepaid  rate  setting  methods,  biased 
selection  and  the  applicability  of  risk  adjustment  methods 
in  recent  health  reform  proposals . 

Chapter  III  reviews  risk  adjustment  modeling 
literature.   This  chapter  begins  with  definitions  of  risk  and 
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risk  adjustment  and  a  conceptual  framework"  for  risk  derived 
from  the  literature.     A  summary  of  risk  model  families, 
recent  risk  model  development  projects  in  each  of  those 
families  and  methodological  approaches  to  risk  model 
development  and  evaluation  follow.     A  summary  of  the  gaps  in 
recent  risk  adjustment  research  and  the  role  of  specific 
condition  model  research  in  addressing  that  gap  concludes 
chapter  III. 

Chapter  IV  presents  methods  used  to  develop  and 
evaluate  two  simple  specific  condition  risk  adjustment 
models  for  the  Medicare  aged.  Topics  discussed  include  data 
sources,  analytic  file  development,   sample  design,  dependent 
and  independent  variables,  model  evaluation  criteria,  model 
development  processes,   statistical  analyses  performed  and 
the  computing  environment. 

Chapter  V  reviews  findings.     Results  comparing  two 
specific  condition  models  with  an  AAPCC  proxy  and  two 
criterion  models,  the  ADG/HOSDOM  and  ADG/MDC  are  presented 
for  all  four  evaluation  criteria:   1) predictive  accuracy, 
2)  administrative  feasibility,  3)  incentives  created  by  the 
model  to  achieve  desired  objectives,  and  4)  ease  of 
manipulating  and  auditing  the  system. 


10 


\ 


Chapter  VI  discusses  implications  and  limitations  of 
findings. 

Chapter  VII  concludes  the  thesis  with  proposals  for 

further  research. 

Appendices,   a  bibliography  and  vita  follow  the  final 
chapter  of  the  thesis. 
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CHAPTER  II:     MARKET  CONTEXT  FOR  RISK  ADJUSTMENT  - 

This  literature  review  summarizes  market  forces  most 
relevant  to  risk  adjustment  of  capitated  payments  to 
health  plans.     Medicare  is  emphasized.       The  review  is 
organized  as  follows. 

First,  evolution  from  a  traditional  f ee-f or-service 
indemnity  health  care  delivery  system  towards  capitated 
managed  care  is  discussed.     Types  of  MCOs  and  growth  in 
market  share  are  reviewed  with  an  emphasis  on  growth  in 
market  share  of  prepaid  plans . 

A  discussion  of  capitation  rating  methods,  including 
how  risk  adjustment  can  be  an  integral  part  of  that 
process,   follows.     This  brief  summary  of  rating  methods 
explains  how  capitation  payment  methods  create  incentives 
for  risk  segmentation  and  selection  bias. 

Biased  selection  is  then  defined  and  related  to 
methods  for  setting  and  risk  adjusting  capitation  rates. 
Market  features  that  encourage  biased  selection  are 
summarized.  Economic  and  social  consequences  of  biased 
selection  are  then  discussed.  Recent  evidence  of  biased 
selection  is  presented  followed  by  a  discussion  of 
methods  proposed  for  minimizing  effects  of  biased 
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selection.  Reasons  for  improving  risk  adjustment  methods 
are  emphasized. 

Market  Evolution  and  Dynamics 

This  section  summarizes  literature  regarding  the 
transformation  toward  capitated  managed  care  in  the 
health  care  delivery  system.     First,     types  of 
alternative  delivery  systems  are  defined  and  related  to 
market  share  trends  for  the  market  overall  and  by  private 
and  public  sector.     Then,   the  prevalence  of  capitation 
payment  is  discussed  by  plan  type  and  market.  This 
discussion  establishes  the  market  context  before 
proceeding  to  a  description  of  capitation  rating  methods 
and  the  incentives  inherent  in  those  methods  that 
encourage  biased  selection. 

Traditional  fee -for- service    (FFS)   health  insurance 
has  dominated  the  US  health  insurance  market  since  its 
inception  in  the  1920s  and  1930s  with  establishment  of 
Blue  Cross  and  Blue  Shield.        Rising  health  care  costs 
and  access  limitations  motivated  development  of 
alternative  health  care  financing  and  delivery  systems. 
(Starr,    1982;   HCFRSS  1995)  . 


13 


The  rapid  and  persistent  rise  in  health  care  costs 
is  evinced  by  trends  in  Personal  Health  Care  Expenditures 
(PHCE)   which  include  all  purchased  health  services  and 
products  for  individual  health  care   (e.g.  hospital, 
physician,  dental,  nursing  home,  drugs  and  medical 
supplies)   and  exclude  expenditures  for  construction, 
program  administration,  government  public  health 
activities  and  research  (HCFRSS,  1995)  .     National  PHCE 
rose  from  $23.9  billion  in  1960  to  $782.5  in  1993,  an 
11.2%  average  annual  rate  of  increase   (HCFRSS,   1995) .  In 
1993  15%  of  the  US  population,   about  38  million 
Americans,  had  no  health  insurance  coverage  of  any  kind 
(HCFRSS,   1995) .     The  increasing  rate  and  magnitude  of 
these  cost  and  access  problems  engendered  alternate 
delivery  system  development  and  health  reform  debates. 

By  the  end  of  the  1970s,   roughly  90  percent  of  the 
employed  population  and  their  dependents  are  covered  by 
traditional  indemnity  health  insurance  and  5  percent  are 
enrolled  in  Health  Maintenance  Organizations   (HMOs)  with 
only  one  fifth  of  HMO  enrollees  in  open  panel  Individual 
Practitioner  Associations   (IPAs)  and  the  remainder  in 
closed  staff  or  group  model  HMOs  (Weiner,   1993)  .     By  the 
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end  of  the  198  0s,  market  share  had  migrated  away-  from 
traditional  fee  for  service  toward  a  variety  of  managed 
care  options  differentiated  from  one  another  based  on  a 
continuum  comprised  of  four  key  dimensions;  who  assumes 
risk   (sponsor,    intermediary  or  physician) ,   degree  to 
which  care  or  utilization  is  managed,   degree  to  which  a 
patient  has  a  choice  of  providers  and  degree  to  which  the 
plan  is  required  to  provide  care   (Weiner,   1993) .  For 
example,   FFS  coverage  is  at  one  end  of  the  continuum  with 
sponsor  assuming  risk,  no  care  or  utilization  management, 
maximum  patient  choice  and  no  care  delivery  requirement 
for  the  insurance  plan.     Closed,   capitated  staff  model 
HMOs  are  on  the  other  end  of  the  continuum  with  plan 
assumption  of  risk,  maximum  care  management,  minimum 
patient  choice  and  care  delivery  required  by  the  plan 
(Weiner,   1993)  . 

The  most  commonly  used    typology    of  "alternative 
health  delivery  systems"     is  one  originated    by  Group 
Health  Association  of  America  in  the  1970s  and  expanded 
by  Interstudy  in  1981    (Welch,   1990) .     This  typology 
includes  four  Health  Maintenance  Organizations  (HMO) 
models:   staff,   group,   individual  practitioner  association 
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(IPA)   and  network  that  can  be  differentiated  along 
Weiner's  continuum  discussed  above  and  by  the  nature  of 
their  financial  arrangements  with  providers    (Welch,  1990; 
Weiner,   1993) .     According  to  Welch,   staff  models  employ- 
physicians,  group  models  contract  with  group  practices, 
IPAs  establish  contracts  directly  with  physicians  and 
Network  plans  contract  with  two  or  more  group  practices 
(Welch,    1990) . 

Preferred  provider  organizations   (PPOs)   Point  of 
Service   (POS)   and  Managed  Indemnity  Plans   (MIPs)  are 
three  additional  alternatives  to  traditional  FFS 
financing  and  delivery  of  care   (Welch,   1990)  .     PPOs  are 
panels  of  physicians  under  contract  to  a  plan  to  provide 
services  on  a  discounted  fee  for  service  basis .  POS 
plans  are  a  blend  where  an  enrollee  chooses  an  HMO  or  PPO 
but  may  opt  to  use  an  "out  of  plan"  provider  and  pay 
additional  FFS  costs  for  doing  so.     MIPs  are  FFS 
indemnity  plans  with  at  least  hospital  precertif ication 
as  a  means  of  "managing"  care   (Welch,   1990) - 

When  combined  with  FFS,   the  four  HMOS,   PPOs,   POS  and 
MIPs  comprise  an  eight  point  typology  of  plans  that 
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describes  a  spectrum  of  indemnity  and  alternative 
delivery  systems   (Welch,   1990;  Weiner,  1993). 

Several  researchers  have  suggested  that  these  eight 
plan  types  are  obsolete   (Welch,    1990;  Weiner,   1993;  Gold, 
1995) .     In  effect,   the  term  "integrated  delivery  systems" 
encompasses  all  8  plan  types.     Integrated  delivery  system 
contractual  arrangements  provide  a  new  basis  for 
characterizing  plans  using  contract  features  that  drive 
cost  containment  and  quality  of  care  incentives .     Type  of 
payment   (salary,   capitation,  FFS) ,   type  of  risk  assumed 
(withhold/bonus  and/or  capitation) ,   amount  of  risk 
assumed  (%  withhold/bonus  or  %  capitation) ,  contract 
tier(s)   at  which  risk  is  assumed   (plan,  intermediary 
and/or  provider) ,  proportion  of  enrollees  or  patients  at 
risk  and  enrollee  lock.- in  provisions   (closed  or  open 
panel)   are  factors  suggested  for  alternative  typologies 
(Welch,   1990;  Weiner,    1993;  Gold,  1995). 

A  new  plan  typology  employing  some  combination  of 
these  factors  may  be  increasingly  important  as  alternate 
delivery  systems  continue  to  transform  in  all  markets. 
However,   the  eight  point  typology  described  above  is 
applied  below  for  ease  in  describing  market  penetration 
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and  payment  mechanisms  by  plan  type  since  the  vast 
majority  of  existing  literature  concerning  plan  market 
share  relies  on  some  form  of  that  typology. 

For  1991  Weiner  reports  37%  of  all  insured  Americans 
are  in  unmanaged  indemnity  plans  or  "FFS" ,     34%  in 
managed  indemnity  plans   (MIPs) ,   15%  in  HMOs  and  14%  in 
PPOs  (Weiner,   1993)  .     Those  proportions  vary  by  private 
(e.g.  employer-based  and  individual  health  insurance)  and 
public     (e.g.  Medicare,  Medicaid  and  Federal  Employee 
Health  Benefit  Program)   sector.     In  the  private  sector 
25%  are  in  FFS,   43%  are  in  MIPs,   18%  in  HMOs  and  17%  in 
PPOs.     In  contrast,   the  public  sector  distribution  for 
1991  is  89%  FFS,    4%  MIPs,    5%  HMOs  and  2%  PPOs. 

The  1994  Marion  Merrell  Dow  HMO  Digest  reports  HMO 
penetration  at  19.4%  of  the  US  Population  in  1993.  Total 
HMO  enrollment  nationwide  rose  10.4%  in  1992.       Growth  is 
greatest  in  the  public  sector.     Medicaid  HMO  enrollment 
rose  21%.     Medicare  enrollment  rose  11%. 

Among  all  HMO  enrollees  in  1993,     4  9%  are  in  IPAs, 
29%    Group,   13%  Network  and  10%  Staff  model  plans.  Group 
models  reported  an  increase  of  23%  followed  by     13%  for 
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IPAs.  Enrollment  decreases  are  reported  by  network  (13%) 
and  staff    (2%)    models.      (Marion  Merrell  Dow,  1994). 

Iglehart  reports  on  the  evolution  of  managed  care 
for  Medicaid  (Iglehart,    1995)  .         Prior  to  1993  only 
Arizona  had  clearance  for  a  Medicaid  program  based  only 
on  managed  care  plan  contracts .     Other  states  had  pilot 
projects.     In  1993,     the  President     committed  to 
streamlining  the  process  by  which  HCFA  grants  Medicaid 
program  waivers  and  the  number  of  states  with  Medicaid 
managed  care  programs  grew.     By  1995,     9  additional 
states  had  waivers  for  managed  care  programs  (Florida, 
Hawaii,   Kentucky,  Massachusetts,   Ohio,   Oregon,  Rhode 
Island,   South  Carolina  and  Tennessee)   and  11  more  states 
had  applied  (Delaware,   Georgia,    Illinois,  Kansas, 
Louisiana,  Minnesota,   Missouri,  New  Hampshire,  New  York 
Oklahoma  and  Vermont) .     An  additional  8  states  made 
inquiries  about  obtaining  waivers.       From  1993  to  1994 
the  proportion  of  Medicaid  managed  care  enrollees 
doubled.      (Iglehart,    19  95).       As  more  waivers  are  granted 
and  managed  care  contracts  are  implemented,   the  growth 
rate  for  Medicaid  managed  care  enrollees  will  increase. 
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Growth  in  MCO  penetration  of  the  Medicare  market 
lags  Medicaid  and  the  private  sector.     Medicare  policy 
permitted  contracting  with  federally  qualified  HMOs  prior 
to  1982.     By  passing  the  Tax  Equity  and  Financial 
Responsibility  Act    (TEFRA)   in  1982,     Congress  encouraged 
enrollment  in  capitated  managed  care  plans.  TEFRA 
simplified  the  contracting  process  and  allowed  bids  from 
a  broader  range  of  managed  care  plans   (i.e.,  competitive 
medical  plans   (CMPs) )    (Zarabozo,   1989) . 

Both  HMOs  and  CMPs  receive  capitated  payments  from 
Medicare  and  are  "at  risk"  for  providing  care  to  their 
enrollees.     There  are,  however,  several  differences 
between  federally  qualified  HMOs  and  CMPs.     First,  HMOs 
must  use  community  rating  to  set  rates.     CMPs  may  use 
experience  rating  to  establish  capitation  rates.  Second, 
HMOs  must  be  a  separate  corporate  entity.     CMPs  may  be  a 
line  of  business  of  another  entity.     Third,  HMOs  must 
provide  or  arrange  for  all  care  whereas  CMPs  do  not. 
Fourth,   CMPs  have  greater  flexibility  regarding  benefits 
to  be  offered  and  the  scope  of  some  required  services 
(e.g.  physician,  inpatient,   lab,  X  ray,  emergency, 
preventive  and  out -of -area  services) .     HMOs  must  provide 
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all  benefits  and  associated  scope  of  services  required  by 
law.     Finally,     CMPs  have  more  freedom  with  respect  to 
copayments  and  deductibles  than  an  HMO.    (Zarabozo,   1989) . 

Between  passage  of  TEFRA  in  1982  and  implementation 
in  1985,  HCFA  sponsored  several  TEFRA  risk  demonstration 
projects.     Demonstration  projects  are  required  to  convert 
to  TEFRA.  upon  implementation  of  regulations  in  198  5 
(Zarabozo,   1989) .      The  effect  of  TEFRA  on  Medicare 
capitated  managed  care  plan  enrollment  is  evinced  by 
enrollment    growth.     Medicare  enrollment  in  risk  plans 
rose  from  2%  in  1981    (pre  TEFRA)    to  3.7%  in  1985  (TEFRA 
demonstration  projects),  7.0%  in  1993  and  over  10.0%  by 
1995    (Wolf,    1996) . 

The  figures  reviewed  above  support  the  notion  of  a 
transition  toward  managed  care  plans  in  all  market 
segments.     Literature  reviewed  below  indicates  capitation 
is  the  key  payment  mechanism  in  the  Medicare  MCO  market. 

In  1985,   20.0%  of  all  Medicare  HMO  enrollees 
(116,608/463,984)   are  covered  by  HMOs  paid  on  a  cost 
basis   (19  93  Data  Compendium)  .     The  remainder  are  in  TEFRA 
risk  demonstration  plans.     By  February  1993,  the 
proportion  of  HMO  enrollees  covered  on  a  cost  basis  had 
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dropped  to  7.9%  and  the  proportion  of  Medicare  HMO 
enrollees  in  fully  capitated  TEFRA  Risk  plans  accounted 
for  90.8%  of  all  enrollees   (1993  Data  Compendium). 
Although  the  current  "Choice"  Demonstration  allows  some 
variation  in  payment  method  (full  capitation,  partial 
capitation  and  blended  FFS  and  capitation)   all  methods 
include  a  minimum  50%  capitation  level   (HCFA  CHOICE  RFP, 
1995) . 

Given  the  predominance  of  capitation  payment  methods 
among  Medicare  MCOs,   financial  incentives  inherent  in 
capitation  payment  methods  are  of  primary  concern  in 
Medicare  risk  adjustment.     In  order  to  better  understand 
the  nature  of  capitation  payment  incentives  and  why  they 
engender  biased  selection,   capitation  rate  setting  is 
summarized,  very  briefly,  before  discussing  biased 
selection  in  greater  depth. 

Capitation  Rate  Setting 

Risk  adjustment  of  capitation  payments  to  managed 
care  plans  is  the  central  purpose  for  the  risk  models 
developed  in  this  study.  The  objective  of  this  section  is 
to  place  risk  adjustment  in  the  context  of  capitation 
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rate  setting  methods,  not  to  present  a  definitive 
description  of  capitation  rate  setting  methodologies  or 
risk  adjustment.     This  section  achieves  that  goal  by- 
presenting  a  brief  overview  of     capitation  rate  setting 
methods,  noting  where  risk  adjustment  fits  in  that 
process,   and  identifying  the  incentives  created  in  the 
capitation  payment  mechanisms  that  encourage  biased 
selection. 

Capitation  rates  reflect  per  capita  revenue  required 
to  cover  estimated  costs  of  care  over  a  specific  time 
period  for  a  particular  market.      (Van  Steenwyk,   198  9)  .  A 
capitation  payment  to  a  health  plan  is  typically  a 
prospective  per  person  per  month   (PMPM)  payment  intended 
to  cover  all  costs  to  be  incurred  by  enrollees  in  the 
coming  year  (Sutton  1987,  Van  Steenwyk  198  9,  Wrightson 
1990) . 

Factors  comprising  capitation  rates  fall  into  one  of 
two  categories:  1)   anticipated  cost  of  health  care 
services  covered  by  the  benefit  package  and 
2)   adjustments  and  loadings   (Sutton  1987,  Van  Steenwyk 
1989,   Wrightson  1990)  . 
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Anticipated  costs  vary  depending  upon  whether  the 
benefit  package  is  comprehensive,  covers  only  a  selected 
segment  of  services   (e.g.  physician  services  and  other 
benefits  defined  under  Medicare  Part  B)   or  is  a  single 
benefit  or  "carve  out"  plan  (e.g.  pharmacy  or  mental 
health  only  coverage) .     Examples  of  services  that  may  be 
included  when  estimating  benefit  package  costs  are 
hospital  services   (inpatient,   outpatient,  emergency  room 
and  skilled  nursing  facility) ,  physician  services  (office 
visits,   inpatient  visits,  consultations,  surgery,  etc.), 
laboratory  and  X-ray  services,   mental  health,  pharmacy 
and  miscellaneous  other  services   (home  health,  ambulance, 
anesthesia  etc.). 

Adjustments  and  loadings  include     risk,  reinsurance 
premiums  and  recoveries,   coordination  of  benefits, 
copayments,   interest  income,   reserve  requirements, 
administration,  marketing,  overhead,     and  profit  (Sutton, 
1987;  Van  Steenwyk,   1989;  Wrightson,   1990) .     It  is  the 
risk  element  in  this  list  of  adjustments  and  loadings 
that  is  estimated  by  risk  adjustment  models.     The  "risk 
weight"  derived  by  the  risk  models  is  applied  to  the  base 
rate  to  adjust  the  base  up  or  down  depending  upon  the 
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risk  value  attributed  to  that  plan's  enrollees  relative 
to  the  norm  for  the  market   (Lubitz,    1987;  Weiner,   1996)  . 

The  American  Academy  of  Actuaries  succinctly 
describes  the  distinction  between  risk  adjusting  premiums 
paid  by  an  insured  and  prospective  or  retrospective 
adjustment  of  capitation  payments  to  health  plans. 


It  is  important  to  distinguish  between  up  front 
risk  adjustments  and  prospective  and/or 
retrospective  risk  adjustment  methods.     Up  front 
risk  adjustments  are  explicitly  reflected  in  the 
actual  premiums  or  contributions  paid  by  the  insured 
so  that  they  reflect  the  particular  risk 
characteristics  of  the  insured.     Prospective  risk 
adjustment  methods  apply  a  carrier  transfer  amount 
to  the  premiums  or  contributions  that  adjust  for  the 
different  relative  risks  of  the  various  carriers  in 
a  given  market.     Retrospective  risk  adjustment 
methods  develop  carrier  transfer  amount  in  a 
settlement  process,  such  as  at  the  end  of  the  year." 
(AAOA,    1993)  . 


The  focus  of  this  dissertation  is  prospective  risk 
adjustment  of  capitation  payments  to  health  plans  which 
requires  estimation  of  costs  to  be  incurred  in  a  future 
time  period.  The  same  risk  assessment  model  may  be  used 
for  prospective  or  retrospective  adjustment  of  payments 
to  health  plans.     These  methods  require  accurate  measures 
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of  the  relative  risk  of  particular  plan  enrollees  (i.e., 

groups    not  individuals)   compared  to  the  average  risk  for 

the  market.     Emphasis  on  relative  risk  of  plans  increases 

the  importance  of  group  accuracy  of  risk  models.  Up 

front  adjustment  of  premiums  paid  by  insureds  calls  for 

greater  individual  predictive  accuracy.     Since  the  focus 

here  is  prospective  risk  adjustment  to  maximize  equity  of 

payments  among  health  plans  within  a  market,  group 

accuracy  has  priority  over  individual  accuracy  . 

A    variety  of  approaches  may  be  used  to  estimate 

health  services  costs  for  the  base  capitation  rate.  Two 

common  formulae  for  calculating  per  member  per  month 

(PMPM)  health  service  costs  or  base  rate  are: 

(Expected  Aggregate  Costs/Number  of  Members) /12 

and 

Sum((Ave  Annual  Rate  per  Service  *  Unit  Cost)/12) 
The  first  formula  is  an  aggregate  cost/budget  approach 
which  divides  an  estimated  annual  total  cost  by  the 
number  of  members  and  by  month.     The  aggregate 
cost/budget  approach  is  most  typically  employed  where 
detailed  per  person  utilization  data  and  unit  costs  are 
not  available    (Van  Steenwyck,   1989)  .  The  second  formula 
is  an  actuarial  or  unit  cost  approach  most  appropriate 
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for  health  plans  that  keep  track  of  utilization  and  costs 
per  service  per  person.        (Van  Steenwyck,    1989)  .  Note 
that  both  formulas  reflect  PMPM  base  rates  for  health 
services  costs  and  do  not  reflect  any  of  the  factors 
included  in  adjustments   (e.g.  for  increased  or  decreased 
risk)   and  loadings   (e.g.  administrative  costs,  profit 
margin) .     These  formulae  calculate  average  health  service 
cost  estimates  or  base  rates  unadjusted  for  variations  in 
risk  among  the  plan  enrollees. 

Market  regulation  may  effect  the  amount  charged  by 
providers  for  services   (e.g.  government  rate  regulation, 
Medicare  hospital  DRG  payment  amounts.  Medicare  and 
Medicaid  maximum  allowable  amounts)   or  require 
modifications  to  the  amount  derived  by  the  basic 
capitation  rate  formulae   (e.g.,  Medicare's  policy  of 
paying  only  95%  of  the  base  AAPCC  rate)  .     However,  such 
regulations  do  not  effect  financial  incentives  for  risk 
segmentation  inherent  in  basic  capitation  rate  setting 
calculations  described  below. 

Health  costs  for  any  individual  may  vary  from  the 
group  mean  by  individual  characteristics,   e.g.,  age, 
gender,   health  status  and  propensity  to  use  health 
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services.     The  difference  in  actual  health  costs-  for 
enrollees  in  a  particular  plan  from  the  average  expected 
for  that  market   (e.g.  Medicare)   reflects  the  degree  and 
direction  of  risk  assumed  by  that  plan  for  their  portion 
of  the  enrollees  from  that  market.       Adjustments  for  that 
risk  are  one  factor  in  the  "adjustments  and  loadings" 
portion  of  the  capitation  rate  setting  process  described 
above , 

The  opportunity  for  managed  care  plans  to  risk 
adjust  their  base  capitation  rate  is  constrained  by 
federal  and  state  regulations    (Wrightson,   1990) .     The  HMO 
Act  of  1973  required  federally  qualified  HMOs  to  set 
rates  using  Community  Rating.     Community  Rating  prohibits 
capitation  rate  risk  adjustment  and  requires  that  PMPM 
payments  reflect  the  community  average.     Amendments  to 
the  HMO  Act  in  1981  allowed  Community  Rating  by  Class, 
where  "class"     is  defined  by  a  set  of  factors  rationally 
related  to  health  care  utilization   (e.g.,   age,  gender, 
industry,   family  composition  etc.)   but  excluding  group- 
specific  experience  measures  such  as  health  status, 
utilization  or  cost  differences.      (Wrightson,  1990). 
Amendments  to  the  HMO  Act  in  198  8  allowed  experience  data 
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(e.g.  prior  utilization,  cost  and  health  status  - 
indicators)   to  be  used  as  risk  adjusters  for  capitation 
rates.     In  effect,   adjusted  community  rating  is 
experience  rating   (AAOA,   1993;  Wrightson,   1990)  . 

The  strength  of  incentives  for  capitated  managed 
care  plans  to  engage  in  biased  selection  or  risk 
segmentation  is  inversely  related  to  the  accuracy  of 
capitation  rates,   i.e.,  the  lower  the  accuracy  the  higher 
the  incentive  for  risk  segmentation. 

Capitation  rates  are  an  estimated  average  cost  for  a 
market   (e.g.  the  Medicare  aged) .     If  capitation  base 
rates  are  unadjusted  or  inadequately  adjusted  for 
differences  in  the  health  risk  distribution  of  plan 
enrollees  compared  with  the  overall  market,   then  there  is 
a  financial  incentive  for  health  plans  to  segment  the 
market  on  the  basis  of  health  risk  by  enrolling  only  the 
healthiest  members  of  the  market  or  by  dumping  high  cost 
enrollees    (Luft,   1995)  . 

Both  capitation  base  rate  formula  shown  above, 

(Expected  Aggregate  Costs/Number  of  Members) /12 

and 

Sum ( (Ave  Annual  Rate  per  Service  *  Unit  Cost)/12), 
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calculate  estimated  mean  plan  revenue  required  per 
enrollee  to  cover  health  care  costs  for  enrollees  under  a 
given  benefit  package.     To  the  extent  that  the  actual 
revenue  requirements  vary  from  the  estimate  for  the 
entire  group  or  for  sub-groups,  plans  will  be  either 
overpaid  or  underpaid.     Potential  for  profit  or  loss 
creates  incentives  for  capitated  managed  care  plans  to 
engage  in  risk  segmentation  and  biased  selection.  The 
more  accurately  capitation  rates  reflect  actual  future 
revenue  requirements,   the  lower  the  potential  for  profit 
or  loss  and  the  lower  the  incentive  for  biased  selection. 
(Luft,   1995;  van  Vliet,  1992). 

Actual  revenue  requirements  are  directly  related  to 
the  true  risk  inherent  in  a  plan's  population.     To  the 
extent  that  adjustments  for  risk  are  permitted  by  law  and 
accurate  risk  assessment  models  exist,  precision  of 
capitated  rates  can  be  improved  and  incentives  for  biased 
selection  and  risk  segmentation  decreased. 

Amendments  to  the  HMO  act  of  1988  that  allow  prior 
experience  data     (e.g.  prior  utilization,    cost  and  health 
status  indicators)   as  risk  adjusters  enable  more  accurate 
capitation  rate  setting  by  maximizing  the  measures  that 
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may  be  used  as  risk  factors   (Wrightson,   1990)  .  Barring 
state-specific  rate  setting  regulations  curtailing  use  of 
particular  factors  as  risk  adjusters   (e.g.  New  York's 
exclusion  of  age  as  a  risk  adjuster) ,  regulatory 
constraints  limiting  accuracy  of  risk  adjustment  has  been 
removed. 

Two  factors  limiting  the  precision  of  capitation 
rates  remain:  the  predictive  accuracy  of  risk  adjustment 
models  and  the  proportion  of  variance  in  revenue 
requirements  that  is  truly  random.     Predictive  accuracy 
of  risk  adjustment  models  is  our  primary  concern. 
However,  predictive  accuracy  must  be  achieved  along  with 
the  three  other  criteria  for  an  effective  risk  adjustment 
model:     administrative  feasibility,   incentives  for  cost 
effective  practices  and  minimization  of  gaming  practices. 
These  four  criteria  are  discussed  in  detail  in  the  next 
chapter  which  reviews  risk  assessment  modeling 
literature.     This  section  introduces  those  criteria  for 
improving  the  effectiveness  of  Medicare  risk  adjustment 
models  in  order  to  reduce  incentives  for  selection  bias. 
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Selection  Bias 

This  section  describes  selection  bias  in  more  detail 
and  illustrates  how  risk  adjustment  of  capitation 
payments  mitigates  the  effects  of  selection  bias.  First, 
a  definition  of  biased  selection  is  presented  followed  by 
evidence  that  biased  selection  exists  in  the  overall 
market  and  for  Medicare.     Then,   economic  and  social 
effects  of  biased  selection  are  discussed.  Market 
features  that  encourage  biased  selection,  with  emphasis 
on  incentives  inherent  in  capitation  rate  setting 
methods,  are  then  reviewed.     A  discussion  of  methods 
proposed  for  minimizing  effects  of  biased  selection 
follows.     Reasons  for  improving  risk  adjustment  methods 
are  then  summarized. 

"Selection  bias"  is  systematic  variation  in  the 
expected  distribution  of  expenditures  for  plan  specific 
enrollees  from  the  expected  distribution  for  all 
beneficiaries  in  the  market  used  to  establish  rates. 
"Favorable  selection"  for  a  plan  is  a  concentration  of 
enrollees  in  the  lower  end  of  the  market  risk 
distribution   (i.e.  are  healthier  than  average  for  the 
market)   indicated  by  lower  than  average  health  service 
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utilization  and  costs  for  that  plan  compared  to  the 
overall  mean  for  the  market.     "Adverse  selection"  for  a 
plan  is  a  concentration  of  enrollees  in  the  upper  end  of 
the  market  risk  distribution    (i.e.   are  sicker  than 
average  for  that  market)   indicated  by  higher  than  average 
health  service  utilization  and  costs.     These  definitions 
are  a  composite  of  definitions  that  appear  in  the 
literature : 

"Adverse  selection  occurs  when  an  individual  or  group 
of  individuals  enrolling  in  a  health  plan  has  predictable 
expenses  that  are  greater  than  the  premium  charged  to 
enroll  in  that  plan.     Favorable  selection  occurs  when 
predictable  expenses  are  less  than  the  premium."  (Anderson- 
-      GF,  1986) 

"If  a  disproportionate  number  of  higher  risk  persons 
enroll  in  one.  .  .plan.  .  .  ,  that  plan  is  said  to  have  adverse 
selection.     In  contrast,  other  plans  which  predominantly 
have  lower  risk  persons  are  said  to  benefit  from  favorable 
selection."     Anderson  RV,   1991) . 

"...  selection  bias  .  .  .  happens  when  risks  are  not 
randomly  distributed  among  various  populations  ....     If  a 
disproportionate  number  of  higher  risk  persons  enroll  in 
one  of  the  plan  choices,  that  plan  is  said  to  have  adverse 
selection.     In  contrast,    . . .  plans  which  .  .  .  have  lower 
risk  persons  are  said  to  benefit  from  favorable 
selection" .      (Hornbrook,  1991b) 

Undesirable  consequences  of  selection  bias  vary 
depending  upon  the  type  of  selection. 
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Favorable  selection  may  produce  windfall  pro-fits  for 
health  plans  and  increased  total  costs  for  payers  (Lubitz, 
1987)  .    Health  plans  profit  because  payments  received 
exceed  the  cost  of  caring  for  enrollees  who  are  healthier 
than  the  average  person  in  the  community  used  to  set 
capitation  rates  (Luft,  1995)  .     Total  health  expenditures 
increase  because  payers  prepay  more  than  costs  healthier 
enrollees  would  have  incurred  in  a  f ee-f or-service  plan 
(GAO,   1994)  . 

Adverse  selection  has  other  unwanted  consequences; 
lower  quality  of  care,  financial  loss  for  health  plans  or 
decreased  access  for  vulnerable  populations (Davis,  1990; 
Hillman,  1993;  Giacomini,  1996;  Lubitz  1987) .  Enrollees 
may  receive  inadequate  treatment  because  the  cost  of  care 
they  require  exceeds  capitation  payments  (Davis,  1990; 
Kronick,  1995) .     Health  plans  may  suffer  financial  failure 
or  refuse  to  provide  coverage  for  high  cost  conditions 
(Lubitz,  1987;  Luft,   1995)  .     If  plans  do  offer  coverage 
for  high  cost  conditions,  access  for  vulnerable 
populations  may  be  limited  by  high  premiums  set  to  protect 
health  plans  from  the  effects  of  adverse  selection. 
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Selection  bias  can  be  caused  by  health  plan  -risk 
segmentation  practices,     enrollees  maximizing  return  on 
investment  for  their  health  insurance  premiums  or  both 
(Luft,   1995) .      Unadjusted  or  poorly  adjusted  per  capita 
payments  to  health  plans  give  health  plans  an  incentive  to 
increase  profits  and  avoid  financial  loss  by  engaging  in 
risk  segmentation  i.e.,  encouraging  the  healthiest  members 
of  a  group  to  enroll  and  highest  costs  members  of  the  plan 
to  disenroll. 

Several    practices  exist  that  enable  health  plans  to 
achieve  favorable  selection  through  risk  segmentation . 
Benefits  may  be  designed  to  discourage  high  cost 
enrollees (Luft ,  1995) .     Preexisting  high  cost  conditions 
may  be  excluded  from  coverage (AAOA,  1993) .  Physician 
panels  can  omit  or  limit  availability  of  specialists  that 
typically  treat  high  cost  conditions (Luft ,  1995). 
Physical  access  to  health  services  may  be  designed  in  a 
way  that  discourages  persons  with  certain  conditions  from 
enrolling  (Luft ,  1995)  .     If    high  cost  patients  do  enroll 
in  plans  engaging  in  any  of  these  practices,  they  have  an 
incentive  to  disenroll  and,  thereby,  contribute  to 
favorable  selection  for  the  plan  (Luft,  1995) . 
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Enrollee  choice  may  result  in  favorable  selection. 
Where  enrollees  have  a  choice  among  plans  that  vary  with 
respect  to  cost,  benefits  covered  and  health  care 
providers  they  will  opt  for  a  plan  that  offers  the  best 
coverages  for  their  needs  at  the  lowest  cost .  Persons 
under  treatment  for  chronic  conditions  at  time  of 
enrollment  may  want  to  remain  with  their  current  health 
care  provider.     The  chronically  ill  have  a  disincentive  to 
switch  to  a  potentially  less  costly  managed  care  option  if 
that  change  requires  switching  health  care  providers. 
This  disincentive  may  lead  to  favorable  selection  for  less 
rich  and  less  costly  plans  and  adverse  selection  for  more 
rich  and  more  costly  plans. 

There  is  recent  evidence  that  biased  selection 
exists .     Two  reports  from  the  Medicare  TEFRA  risk 
contract  evaluation  project  indicate  presence  of 
favorable  selection  for  the  Medicare  aged   (Hill,  1990; 
Brown,   1993) .     A  case  control  study  of  HMO  enrollees  and 
nonenrollees  found  that  HMO  enrollees  are  healthier  on 
average  than  nonenrollees  after  controlling  for 
differences  in  age,  gender,   institutional  status  and 
welfare  status   (Hill,   1990) .     Three  measures  supported 
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that  conclusion.     Compared  to  nonenrollees  in  the  same 
market  areas  for  the  period  immediately  preceding 
enrollment  HMOs  cost  23%  less  than  nonenrollees,  had  25% 
fewer  nondiscretionary  hospitalizations  known  to  predict 
expenditures  and  had  25%  lower  mortality  rate.  Further 
evaluation  of  the  TEFRA  risk  contracts  concluded  that  HMO 
enrollees  tend  to  be  younger,  are  more  likely  to  report 
higher  general  health  status  and  less  likely  to  have 
physical  disabilities,   chronic  conditions,   or  to  use  the 
health  system  when  they  did  not  feel  well   (Brown,   1993) . 

Hellinger  summarizes  recent  evidence  of  biased 
selection  in  Medicare  and  the  private  sector  (Hellinger, 
1995) .     In  addition  to  the  two  studies  mentioned  above, 
Hellinger  sites  six  studies  that  found  favorable 
selection  in  Medicare  HMOs     (Brown,   1987;  Riley,  1989 
1991;   Eggers,    1980  1982;  Lichtenstein,    1991) .     For  the 
private  sector,   Hellinger  reports  results  from  three 
early  studies   (Luft,   1988;  Hellinger,   1987;  Wilensky, 
1986)   and  several  later  studies   (Hadley,   1991;  Riley, 
1989  1991;  Lichtenstein,   1991;  Palenicek,   1993;  Kravitz, 
1992;  Robinson,    1993;  Wouters,   1988;  Strumisser,  1989; 
Zwanziger,   1991)   all  of  which  support  presence  of 
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favorable  selection  among  HMOs  and  PPOs  compared-  with 
indemnity  plans.     One  study  reviewed  found  no  difference 
between  indemnity  and  PPO  plans   (Hosek,   1990)  . 

Although  contradictory  findings  suggesting  presence 
of  adverse  selection  have  been  reported  (Dowd,   1996) ,  the 
preponderance  of  evidence  suggests  favorable  selection 
does  indeed  exist  in  Medicare  Risk  HMOs   (Brown,  1987 
1993;  Hellinger,   1995;  Riley,   1989  1991;   Eggers,  1980 
1982;  Lichtenstein,   1991) . 

There  are  several  ways  to  reduce  or  manage  effects  of 
biased  selection.     Special  risk  pools  can  be  established  for 
high  cost  persons,  small  employer  groups  and  individual 
insurance.    California  and  New  York  have  implemented 
legislation  that  take  this  approach  (Shewry  1996,  Padgug  1995) . 
"Carve -out"  or  single,  benefit  coverage  could  be  designed  for 
particular  diseases  or  high  cost  episodes  of  care.  Reinsurance 
and  stop  loss  or  special  risk  insurance  can  protect  health 
plans  from  financial  loss  due  to  adverse  selection. 
Retrospective  adjustments  could  be  paid  to  health  plans  with 
adverse  selection  or  high  cost  outliers.     New  York  adopted  this 
approach  in  their  recent  legislative  reforms  (Padgug,  1995) . 
Mixed  FFS  and  capitated  payment  mechanisms  could  be  developed 
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to  reduce  financial  risks  for  capitated  health  plans  (Newhouse, 
1986  1994) .  Risk  adjustment  models  for  capitated  payment  could 
be  improved  (Newhouse,   1986     1994;  GAO,    1994) . 

Each  of  these  methods  help  reduce  or  manage  effects 
of  biased  selection.     However,  it  is  unlikely  that  any  one 
approach  will  eliminate  potential  for  and  problems 
associated  with  biased  selection  (Luft,  1995)  .  Luft 
presents  a  cogent  summary  of    reasons  for  improving  risk 
adjustments  methods  even  if  other  market  reforms  are 
implemented. 

"The  only  time  risk  assessment  and  risk  adjustment 
are  not  necessary  is  when  there  is  a  unitary  plan 
enrolling  the  entire  eligible  population" .     "As  long 
as  people  are  offered  a  choice  of  health  plans, 
whether  they  be   ...  HMOs  or  other  types  of  insurance 
plans  that  accept  risk,   and  as  long  as  health  plans 
have  any  discretion  in  the  groups  or  individuals 
they  enroll,   there  must  be  methods  to  assess  and 
adjust  for  risk  differences.       ...     To  some  extent, 
health  insurance  market  rules  influence  the 
potential  for  risk  selection.     However,   even  the 
most  extensive  changes  to  allowable  business 
practices  leave  open  the  possibility  of  risk 
differences  among  plans.     Thus,   to  assure  that  all 
people  have  equal  access  irrespective  of  their  risk, 
it  is  necessary  that  risk  be  measured  and  payments 
be  adjusted  to  reflect  risk  differences.  However, 
it  is  not  necessary  that  individuals  bear  the 
financial  consequences  of  their  risk  differences. 
That  is,   all  enrollees  may  pay  the  same  community- 
rated  premium  irrespective  of  their  risk  as  long  as 
their  sponsor   (employer,  purchasing  cooperative,  or 
government)   provides  risk- adjusted  payments  to  the 
health  plan."    (Luft,   1995) . 
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Risk  Adjustment  and  Health  Reform 

Several  recent  health  reform  proposals  at  the  state  (e.g. 
California,  Ohio,  Maryland  and  Minnesota)  and  federal  level 
require  effective  risk  adjustment  models  in  order  to  achieve 
the  cost  containment,  access  and  quality  objectives.  Three 
examples  of  reform  proposals  are  discussed  below  to  illustrate 
the  need  for  risk  adjustment  in  health  reform  across  markets; 
President  Clinton's  1993  plan  (Health  Security  Act  1993), 
California's  Health  Insurance  Purchasing  Cooperatives  (HIPC) 
for  small  groups  and  individuals  (Shewry,  1996)   and  the 
Medicare  Choice  Demonstration  (HCFA  CHOICE  RFP,   1995) . 

President  Clinton's  1993  health  reform  proposal  mandated 
development  of  effective  risk  adjusters  for  purposes  of 
adjusting  rates  paid  to  plans  participating  in  regional  HIPC 
and  for  transferring  funds  between  HIPCs  based  on  differences 
in  their  risk  pools  (Health  Security  Act  193)  . 

In  1995  California  implemented  a  HIPC  for  the  individual 
and  small  group  market  (Shewry,  1996)  .     The  HIPC  is  managed  by 
California's  Managed  Risk  Medical  Insurance  Board  (MRMIB) ,  a 
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voluntary  semi -independent  state  board.     The  board  worked  with 
participating  health  plans  and  a  consulting  firm  to  develop  a 
specific  condition  or  marker  diagnosis  risk  adjustment  method 
to  calculate  fund  transfers  between  plans  based  on  plan  case 
mix  variation  from  the  norm  for  the  HIPC. 

Although  no  Medicare  reform  has  been  passed  by  Congress, 
the  Medicare  "Choices"  Demonstration  Program  initiated  in  1996 
will  test  whether  expanded  enrollment  in  prepaid  managed  care 
organizations  will  achieve  desired  cost  containment,  access  and 
quality    objectives  (HCFA  Choices  RFP,  1995) .     One  aspect  of 
this  project  is  "demonstration"  of  various  risk  adjustment 
methods  such  as  the  ADG/MDC  or  ADG/HOSDOM    models  (Weiner, 
1996)  or  the  HCC  model   (Ellis,  1996)  . 

Each  health  plan  participating  in  the  "Choice" 
demonstration  is  required  to  provide  standard  Medicare  part  A 
(hospital)  and  Part  B   (medical)  services  without  charging  any 
additional  premium  to  enrollees  for  those  services.     Each  plan 
must  have  a  pricing  process  that  addresses  three  components;  1) 
payment  methodology,  2)   risk  adjustment  methodology  and  3) 
reinsurance  arrangement  with  HCFA  (HCFA  CHOICE  RFP,   1995)  . 

Each  plan  must  submit  one  of  three  payment  methodologies; 
full  capitation,  partial  risk   (i.e.  capitation  with  risk  for 
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all  services  in  all  settings  shared  between  the  plan  and  HCFA 
based  on  risk  bands  set  about  the  capitation  amount)  or  blended 
capitation  and  f ee-f or-service  with  a  minimum  50%  capitation 
(HCFA  CHOICE  RFP,  1995)  .     All  three  methods  require  calculating 
capitation  rates  using  an  adjusted  community  rating  process 
proposed  by  the  plan.     Therefore,  the  accuracy  of  any  payment 
methods  could  be  maximized  by  using  the  best  possible  (i.e. 
more  accurate,  feasible,  incentive  appropriate  and  easily 
audited)  risk  adjustment  methodology  (HCFA  CHOICE  RFP,  1995)  . 

Participating  plans  may  use,  and  thereby  demonstrate  the 
effectiveness  of,  a  risk  adjustment  methodology  but  are  not 
required  to  do  so.     Initially,  HCFA  intended  to  offer  three 
risk  adjustment  methods  to  participating  plans  for 
demonstration  1)  the  AAPCC,  2)   the  ADG/HOSDOM  and  ADG/MDC 
(Weiner,  1996)  and  3)  Hierarchical  Coexisting  Conditions  (HCC) 
developed  at  HERI   (Ellis,  1996) .     The  ADG  models  are  not,  in 
fact,  offered  due  to  unavailability  of  software  for  plan  use 
during  the  demonstration  period  (Ingber,  1997)  . 

Finally,  plans  opting  for  the  first  payment  method,  full 
capitation,  may  enter  a  reinsurance  (outlier  payment) 
arrangement  with  HCFA.     Under  this  arrangement  HCFA  will 
provide  individual  reinsurance  at  any  level  in  excess  of 
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discretion  in  the  groups  or  individuals  they  enroll,  there 
must  be  methods  to  assess  and  adjusts  for  risk  differences." 
(Luft,   1995)  .     Without  a  better  risk  adjustment  methods, 
overall  Medicare  costs  will  increase  while  access  to  and 
quality  of  care  may  decrease. 

These  three  examples  of  proposed  and  implemented  health 
reforms  just  discussed   (Health  Security  Act  1993;  California, 
1995;  Medicare  Choices  RFP,   1996)   all  require  effective  risk 
adjustment  models  to  mitigate  the  effects  of  selection  bias  in 
their  respective  markets .     The  next  chapter  reviews  the  state 
of  the  art  in  risk  adjustment  model  development. 
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Chapter  III        Risk  Adjustment  Research 

This  chapter  is  comprised  of  six  sections .  The 
first  section  is  an  overview  which  outlines  three  major 
steps  in  the  risk  adjustment  process  and  notes  that  this 
research  addresses  the  first  two  steps  only;  risk 
assessment  modeling  and  model  evaluation. 

The  second  section  presents  a  conceptual  framework 
for  capitation  risk  adjustment  model  development  derived 
from  a  review  of  risk  adjustment  literature.  The 
concept  of  risk  is  defined  and  a  conceptual  model  that 
identifies  factors  affecting  risk  is  presented. 

The  third  section  reviews  families  of  risk 
adjustment  models.     Families  are  defined  by  risk  factor 
domains  from    the  conceptual  framework  contained  in  the 
model.     Two  typologies  from  the  literature  are  reviewed 
and  a  third  is  proposed.     Examples  of  recently  developed 
risk  adjustment  models  within  the  proposed  typology  of 
risk  model  families  are  presented  and  described  in  this 
section . 

The  fourth  section  is  devoted  to  an  overview  of  methods 
applicable  to  risk  assessment  model  development. 
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evaluation  and  comparative  performance  based  on 
evaluation  results.     Three  areas  are  discussed;  1) 
approaches  to  operational  definition  of  predictors,  2) 
techniques  for  calibrating  and  validating  a  model  and 
3)   evaluation  criteria  and  procedures. 

The  fifth  section  summarizes  results  of  recent 
research  evaluations  including  the  advantages  and 
disadvantages  of  each  family  of  models. 

The  final  section  summarizes  gaps  in  research  and 
how  this  project  contributes  to  bridging  one  gap  for  the 
Medicare  market. 
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Overview 

The  risk  adjustment  process  has  three  steps:  risk 
assessment  modeling,  model  evaluation  and  implementation  of  the 
risk  adjustment  methodology.    Figure  III-l  is  an  outline  of 
risk  adjustment  process  listing  major  subcomponents  of  those 
three  steps.     The  outline  is  derived  for  this  research  from 
examination  of  methods  applied  in  various  risk  model 
development  research  projects  (Anderson,  1990;  Ash,   1989;  Dunn, 
1995;  Ellis  and  Ash,   1995;  Ellis,  1996;  Health  Insurance  Plan 
of  California,  1995;  Kronick,  1995  1996;  Starfield,  1991; 
Weiner,  1991  1996) .     The  outline  serves  the  purpose  of  placing 
this  research  in  context  relative  to  the  overall  capitation 
payment  risk  adjustment  processes.     This  research  addressees 
the  first  two  steps  of  the  process:  risk  assessment  modeling 
and  model  evaluation.    The  remaining  sections  of  this  chapter 
review  risk  assessment  modeling  and  evaluation  literature 
address  subcomponents  of  those  first  two  steps  in  the  order 
presented  in  the  outline.  Discussion  of  the  conceptual 
framework  for  risk  adjustment  of  capitation  payments,  families 
of  models  based  on  that  conceptual  framework  and  a  description 
of  recent  models  within  each  family  begins  the  review. 
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FIGURE  III 

-1 

RISK  ADJUSTMENT  MODEL  DEVELOPMENT  PROCESS 

STEP 

1 

RISK  ASSESSMENT  MODELING 

a . 

Specify  Conceptual  Framework 

b. 

Define  Risk  Factors 

c . 

Choose  statistical  estimation  model 

d. 

Obtain  Measurements 

e . 

Calibrate  effects  of  risk  factors 

f . 

Validate  risk  factor  effects 

g- 

Refine  by  repeating  steps  la- If  as  needed 

STEP 

2 

MODEL  EVALUATION 

a . 

Predictive  accuracy 

b. 

Administrative  feasibility 

1.  Data  availability 

2 .  Cost/operational  burden 

3 .  Acceptability  to  stakeholders 

c . 

Incentives  for  cost  effectiveness 

d. 

Ease  of  manipulating  and  auditing 

e . 

Refine  by  repeating  steps  la- 2d  as  needed 

STEP 

3 

RISK  ADJUSTMENT  PROCESS  IMPLEMENTATION 

a . 

Develop  implementation  plan 

1.  Define  roles  and  responsibilities 

2.  Define  risk  adjustment  cycle  timeline 

b. 

Implement  risk  adjustment  process 

c . 

Evaluate  process 

d. 

Refine  by  repeating  steps  la- 3c  as  needed 

Conceptual  Framework 

Risk  assessment,  the  first  step  in  risk  adjustment, 

concerns  measurement  and  calibration  of  risk.     The  American 

Academy  of  Actuaries  defines  risk  assessment  as 

"a  means  of  determining  objectively  whether  an  individual 
or  group  represents  a  risk  that  is  reasonably  close  to  the 
average  and,  if  not,  of  quantifying  the  relative  deviation 
from  the  average."    (AAOA,  1993) 
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The  risk  assessment  process  begins  with  a  conceptual  framework 
for  risk  that  defines  the  concept  of  risk  and  identifies 
domains  of  factors  that  affect  the  risk. 

In  her  book  entitled  "Risk  Adjustment  for  Measuring  Health 
Care  Outcomes",   lezzoni  notes  that  the  concept  of  "risk"  is 
relative  and  inextricably  linked  to  a  specific  outcome  of 
interest.     (lezonni,  1994) .    Given  accurate  per  capita  payments 
to  health  plans  as  the  outcome  of  interest  and  Hornbrook's 
conceptualization  of  health  insurance  risk  as  a  cost 
distribution,  in  this  project  "risk"  is  variation  in  actual 
plan  expenditures  from  expected  costs  in  a  particular  market. 
Thus,  the  "risk"  involved  in  capitation  rate  setting  is  risk  of 
underpayment  or  overpayment . 

A  more  general  definition  of  "risk"  derived  from 
capitation  payment  and  health  outcomes  literature  (AAOA,  1993; 
lezonni,  1994;  Hornbrook,  1989  1991a)  is:   "   'Risk'   is  the 
degree  of  difference  from  a  standard  for  a  specific  outcome." 
This  general  definition  of  risk  is  applicable  here  and  more 
broadly  to  the  field  of  health  services  research. 
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Broad  applicability  of  this  definition  is  exemplified  by 
the  following  application  to  health  outcomes  risk  assessment. 
The  standard  of  care  for  patients  with  severe  asthma  includes 
daily  use  of  corticosteroid  inhalers  (NHLBI,  1992)  .     If  the 
outcome  of  interest  is  health  plan  compliance  to  that  standard 
of  care,  variation  below  1.0  in  the  proportion  of  severe  asthma 
patients  who  have  a  corticosteroid  inhaler  is  the  degree  of 
risk  for  undertreatment  of  severe  asthma  in  that  plan.     If  one 
wanted  to  rate  plans  with  respect  to  risk  of  undertreatment  of 
severe  asthma,  one  would  develop  and  evaluate  a  risk  assessment 
model  specifically  for  that  definition  of  risk. 

The  specific  form  of  the  more  general  definition  which 
guides  the  conceptual  framework  for  risk  assessment  in 
capitation  rate  setting  is:   "   'risk'  is  the  degree  of  variation 
from  mean  estimated  expenditures,  i.e.  deviation  from  the  base 
capitation  rate . " 

In  any  application,  the  risk  assessment  process  of 
classifying  enrollees  by  measurable  factors  affecting  risk 
begins  with  a  conceptual  framework  that  specifies  types  or 
domains  of  factors  which,   in  theory,  account  for  variation  in 
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risk.     Hornbrook  posits  variation  in  mean  health  costs  for  an 
enrollee  population  depends  on  factors  from  four  domains: 
enrollee  characteristics  (e.g.,  health  status,  demographics, 
health  beliefs,  propensity  to  use  resources) ,  community 
characteristics  (e.g.,  expectations  for  medical  care,  medical 
resource  availability,  regional  cost  differences) ,  plan 
characteristics  (e.g.,  plan  efficiency,  financing  mechanism, 
number  of    tiers)     and  provider  characteristics  (e.g.,  practice 
style,  demographics,  health  beliefs)    (Hornbrook,  1989) .  Using 
Hornbrook' s  conceptual  framework  shown  in  Figure  II I -2, 
variation  in  mean  (per  capita)  costs  for  a  given  market  is  a 
function  of  enrollees,  communities,  plans  and  providers. 


Figure  I II -2     General  Conceptual 
Risk  of  Inaccurate 

Framework  -  Basic  Research 
Capitation  Payments 

Enrollee  characteristics   > 

Variation  from 

Community  characteristics  > 

mean  (per  capita) 

Plan  Characteristics   > 

estimated  cost 

Provider  Characteristics   > 

in  a  defined  market 
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If  one  were  conducting  basic  research  to  identify  causal 
factors  effecting  risk,  one  would  fully  specify  this  conceptual 
model  in  the  hope  of  showing  the  direction  and  strength  of 
statistically  significant  relationships  between  risk  and  all 
subdomains  and  indicators  for  characteristics  of  employees, 
communities,  plans  and  providers.     Basic  research  is  primarily 
concerned  with  the  internal  validity  of  research  design  that 
can  support  identification  of  statistically  significant 
universal  causal  relationships  (Hedric,  1993) . 

Applied  research,  such  as  risk  adjustment,  differs  from 
ba_sic  research  in  that  it  addresses  a  particular  problem  by 
discovering  relationships  or  effects  of  practical  significance 
that  have  external  validity  i.e.  that  can  be  applied  in  real 
world  settings  beyond  the  research  or  study  sample  (Hedric, 
1993)  .     While  all  constructs  identified  in  Figure  III-2  would 
be  considered  in  risk  model  development,  only  those  domains, 
subdomains  and  their  indicators  that  contribute  to  increased 
accuracy  of  capitation  payments,  are  administratively  feasible, 
provide  appropriate  incentives  and  minimize  gaming  of  the 
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system  would  be  incorporated  in  the  conceptual  framework  for 
applied  risk  assessment  research. 

In  practice  (AAOA,   1993  1994;  Anderson,  1990;  Ash,  1989; 
Dunn,   1995;  Ellis  and  Ash,  1995;  Ellis,  1996;  Epstein  and 
Cumella,  1988;  Fowles,   1994  1996;  Gruenberg,  1996;  Health 
Insurance  Plan  of  California,  1995;  Hornbrook,  1991b  1991c; 
Kronick,  1995;  Lubitz,   1987;  Newhouse,  1989;  Starfield,  1991; 
Weiner,  1991  1996)  ,  capitation  risk  assessment  focuses 
primarily  on  decomposition  of  enrollee  characteristics  into 
indicators  of  risk  as  shown  in  Figure  III-3. 


53 
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Figure  III-3       Conceptual  Framework  -  Applied  Research 

Risk  of  Inaccurate  Capitation  Payments 

Enrollee    characteristics  > 

Demographics  \ 

(age,  gender)  \ 

Socio-economic  Variation 

from 

(welfare  status)                                       mean   (per  member) 

Health  Status  estimated 

cost 

(Direct  measures     -e.g.  Conditions  / 

SF-36  / 

lADLs  / 

(Indirect  measures -e .g.  Prior  Use  / 

ACGs,  DCG  / 

PACS  / 

Community  characteristics  > 

Region 

Plan  and  provider  characteristics  are  excluded  from  the 
conceptual  framework  for  applied  research  because  they 
undermine  a  key  objective  of  prospective  payments,  providing 
incentives  for  plan  efficiency  and  reducing  unnecessary  or 
undesirable  practice  pattern  variation.     By  adjusting  for 
differences  in  plan  and  provider  characteristics, 
inefficiencies  in  plan  or  provider  practices  would  be  rewarded. 
For  example,  if  a  plan  that  provides  health  care  to  a  broad 
spectrum  of  the  population  has  a  much  greater  proportion  of 
high  cost  specialists  than  a  typical  plan  and  "proportion  of 
specialists  in  the  plan"  is  a  risk  factor  included  in  the 
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model,  that  plan  would  be  rewarded  for  overstaffing  with 
specialists  and  charging  higher  rates  to  cover  their  costs  by- 
having  their  estimated  mean  cost  adjusted  upward,  whereas  plans 
with  a  more  cost  effective  staffing  mix  would  be  penalized. 

One  measure  of  community  characteristics,  geographic 
location,  is  often  incorporated  in  risk  models  to  adjust  for 
regional  variations  in  cost  of  providing  medical  services,  e.g. 
county  code  in  the  AAPCC  (Anderson,  1990)  .     Other  community 
indicators  (e.g.  health  resource  availability  such  as  hospital 
beds/1000)  are  not  typically  included  because  data  is  not 
readily  available  or,  like  plan  and  provider  characteristics, 
those  predictors  would  provide  inappropriate  incentives  if 
included  in  the  model.     For  example,  including  hospital  beds 
per  1000  population  could  be  contrary  to  the  incentives  for 
cost  effectiveness  by  rewarding  inflated  rates  due  to  excess 
hospital  capacity  or  inefficiencies  due  to  supplier  induced 
demand . 

Several  sub-domains  of  enrollee  characteristics  have  been 
incorporated  in  earlier  risk  models.     Epstein  and  Cumella 
(1988)  provide  tables  showing  predictors  used  in  42  Medicare 
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risk  modeling  research  projects  reported  between  1970  and  1985. 
Socio-demographic  variables  in  40  studies  include  age,  gender, 
income,  education,  race,  marital  status,  white  collar  status, 
social  class,  employment  status,  household  composition,  social 
isolation  and  insurance  coverage.     Prior  use  factors  from  12 
studies  include  total  expenditures,  prior  hospitalization (s) , 
prior  surgery,  readmissions  per  diagnosis,  acute 
hospitalization,  chronic  disease  hospitalizations,  no  prior 
use,  prior  hospital  days,  ambulatory  use,  hospitalizations  by 
complexity  of  diagnosis,  ambulatory  coverage  deductible  met, 
prior  physician  visits,  and  medical  relative  value  units. 
Funcitonal  Status  and  clinical  predictors  from  25  studies 
include  disability  status,  psychic  distress,  activity 
limitations,  activity  of  daily  living  score,  instrumental 
activity  of  daily  living  score,     restricted-activity  days, 
number  of  bed  days,  perceived  health,  presence  of  acute  or 
chronic  problem,  number  of  chronic  problems,  number  of  acute 
problems,  number  of  reported  chronic  conditions,  and  index  of 
physical  symptoms  (Epstein  and  Cumella,  1988)  .     None  of  the  42 
studies  reviewed  by  Epstein  and  Cummella  employed  direct 
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measures  of  specific  health  conditions  as  predictors.  Emphasis 
in  earlier  research  is  on  socio-demographics ,  prior  utilization 
and  self -reported  measures  of  general,  functional  and  social 
health  status. 

More  recent  risk  assessment  efforts  emphasize  diagnosis- 
based  direct  or  indirect  health  status  predictors  (Anderson, 
1990;  Ash,   1989;  Dunn,   1995;  Ellis  and  Ash,   1995;  Ellis,  1996; 
Fowles,  1994  1996;  Gruenberg,  1996;  Health  Insurance  Plan  of 
California,  1995;  Hornbrook,  1991b  1991c;  Kronick,  1995  1996; 
Lubitz,   1987;  Newhouse,   1989;  Starfield,   1991;  Weiner,  1991 
19,96)  .     Direct  health  status  measures  include  diagnosed 
conditions,  disability  status  and  activity  limitations 
(Gruenberg,  1996;  Health  Insurance  Plan  of  California,  1995; 
Kronick,   1996;  Ellis,  1996)  .     General  or  disease  specific  prior 
utilization  and  complex  groupings  of  diagnoses  based  on 
homogeneity  of  costs  are  more  indirect  health  status  measures 
(Anderson,   1990;  Ellis  and  Ash,   1995;  Ellis,   1996;  Kronick, 
1996;  Weiner,   1991  1996)  . 

The  majority  of  recent  research  relies  on  administrative 
claims  data  (Anderson,  1990;  Ash,   1989;  Dunn,   1995;  Ellis  and 
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Ash,   1995;  Ellis,  1995  1996;  Health  Insurance  Plan  of 
California,  1995;  Kronick,  1996;  Starfield,  1991;  Weiner,  1991 
1996)  .     A  few  studies  use  survey  data  (Gruenberg,  1996; 
Hornbrook,  1991b  1991c;  Lubitz,   1987;  Newhouse,   1989).  One 
study  compares  performance  of  existing  models  that  employ 
either  claims  or  survey  data  collected  from  the  same  population 
(Fowles,  1994  1996) .     Figure  III-4  summarizes  recent  projects 
by  type  of  predictor,     population  and  data  source  including 
models  developed  here.  Although  not  yet  in  the  literature,  the 
two  specific  condition  (SPEC)  models  developed  here,  SPEC  I  and 
SPEC  II,  are  included  in  Figure  III -4  to  aid  discussion  at  the 
end  of  this  chapter  regarding  how  they  extend  recent  research . 
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Families  of  Risk  Models 

Risk  model  families  have  typically  been  distinguished  by  the 
subdomain  of  health  status  measures  included  in  the  model  and/or 
by  data  source  for  measuring  those  indicators  (i.e.  claim  or  self- 
report) . 

Epstein  and  Cummella  (1988)  grouped  models  in  six  categories; 
perceived  health  status,   functional  health  status,  prior 
utilization,  clinical,  socio-demographic  and  other.     Perceived  and 
functional  health  status  are  both  direct  self -report  measures  of 
health  status.     Prior  utilization  (e.g.  costs  or  intensity  of 
services  is  an  indirect  or  proxy  measure  of  health  status. 
Clinical  models  use  various  measures  of  morbidity  that  may  be 
direct  (e.g.  a  simple  list  of  diagnosed  conditions)  or  more 
indirect  (e.g.  complex  groupings  of  diagnoses  based  on  homogeneity 
of  costs  such  as  DCGs,  a  combination  of  clinical  criteria  such  as 
ACGs  or  diagnosis  specific  utilization  such  as  PACs) .     The  Epstein 
and  Cummella  classification  served  for  earlier  models  that  tend  to 
differentiate  along  those  dimensions.     However,  more  recent  models 
tend  toward  combinations  of  two  or  more  of  those  categories. 
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The  GAO  references  four  risk  model  families:  clinically- 
based,  prior  utilization,  combination  and  self-report  models  (GAO 
1994)  .     The  GAO  definition  of  clinically-based  and  prior 
utilization  models  is  similar  to  that  just  described  with  the 
exception  that  PACs  is  considered  a  prior  utilization  model 
(despite  the  use  of  clinically-based  measures  of  intensity  of 
hospital  utilization)  and  DCGs  are  considered  "combination" 
models.     The  GAO  typology  is  problematic  because  family  categories 
do  not  allow  mutually  exclusive  assignment  of  new  models  (e.g. 
self -report  models  that  use  clinical,  prior  use  or  a  combination 
of  those  predictors) . 

A  typology    that  does  not  confound  data  source  (e.g.  claims 
vs.  self  report)  with  subdomain  of  health  status  measure  (e.g. 
diagnosed  conditions,  activity  limitations,  disability  etc.)  or 
complexity  of  measures  (e.g.  from  simple/direct  to 
complex/indirect) ,  similar  to  the  classification  shown  in  Figure 
III-4  above  would  simplify  classification  of  risk  models. 

If  we  focus  on  the  health  status  categories  from  Figure  III -4 
(simple/direct,  complex/indirect)  and  add  a  second  dimension  for 
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source  of  data  (self-report  or  claim)  we  obtain  a  simple  4  part 
taxonomy  of  risk  model  families  defined  in  Figure  ilI-5.  This 
taxonomy  is  used  below  to  discuss  recent  research  within  each  of 
four  families  of  risk  models;  1)    simple/direct  claim,  2) 
complex/indirect  claim,  3)   simple/direct  self -report,  4 
complex/indirect  self -report. 


Figure  III -5 

Four  Risk  Adjustment  Model  Families 
Defined 

Tvpe  of  Health  Status  Measures 

Data  " 
Source 

Direct/ 
Simple 

Indirect/ 
Complex 

Claim 

Condition  List 

Condition  Grouped 
by  Clinical  Criteria 
and/or  Cost  Levels 

Condition  Specific  Use 

Prior  Use 

Self - 
Report 

Condition  List 
ADLs ,  lADLs 
Disabilities 
SF-36 

Condition  Specific  Use 
Prior  Use 
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Although  this  typology  allows  a  more  logically  consistent 
classification  of  models  than  others  described  above,  it  requires 
rules  for  determining  when  to  classify  a  model  at  either  end  of 
the  simple/direct  to  complex/ indirect  continuum.     Several  models 
have  a  mix  of  simple/direct  and  complex/indirect  measures.  Some 
"cut  point"  for  categorization  is  needed.     Requiring  100%  of  all 
variables  to  fall  into  one  category  could  misrepresent  the  nature 
of  a  model  if  the  majority  of  predictors  fall  into  one  category. 
For  ease  in  classifying  models,  the  rule  applied  here  is 
assignment  of  a  model  to  the  most  complex/indirect  family  provided 
that  at  least  1/5  of  the  predictors  fall  into  that  category. 
Although  this  rule  is  admittedly  arbitrary,  it  serves  to  clarify 
family  designation  and  is  simple  to  understand  and  apply. 

Figure  III-6  places  recent  models  listed  in  Figure  III-4  in 
family  groups  according  to  the  definition  given  in  Figure  III-5. 
Although  not  yet  in  the  literature,  the  two  specific  condition 
(SPEC)  models  developed  here,  SPEC  I  and  SPEC  II,  are  included  in 
Figure  III -6  to  aid  discussion  regarding  how  they  extend  recent 
research . 
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Figure  III -6 

Recent  Risk  Model 

Research  Grouped  in 

Four  Risk  Model  Families 

Type  of  Health  Status  Measures 

Data 

Direct/ 

Indirect/ 

Source 

Simple 

Complex 

Claim 

SPEC  I 

HCC,   DPS,   DCG,  PACS 

SPEC  II 

ACQ,  ADG/HOSDOM, 

Marker  DX 

ADG/MDC 

Self 

PPRC,  MCBS, 

Report 

SF-36 

Recent  risk  modeling  research  will  now  be  reviewed  in  order 
by  family;  1)  Simple/Direct  Claim,  2)   Complex/Indirect  Claim,  and 
3)  Simple/Direct  Self -report.    Of  the  research  reviewed  none  fell 
within  the  fourth,  Complex/Indirect  Self -report  family,  therefore, 
only  three  of  four  families  are  discussed. 

This  section  describes  each  model  in  terms  of  their 
predictors  and  criteria  for  defining  those  predictors  where 
reported  in  the  literature.    Methods  for  model  calibration  and 
evaluation  and  comparative  results  of  each  model  are  discussed  in 
subsequent  sections. 
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Recent  research  produced  three  simple/direct  claim  models; 
California  Marker  Diagnosis  model  and  the  two  specific  condition 
models  developed  here  (SPEC  I  and  SPEC  II)  described  in  detail  in 
the  methods  chapter.     Briefly,  the  SPEC  I  model  is  a  list  of  3  0 
potentially  coexisting  conditions  indicated  by  dummy  variables 
with  the  condition  considered  present  for  an  enrollee  if  they  had 
at  least  one  diagnosis  on  any  type  of  claim  indicating  presence  of 
that  condition  in  the  prior  year.     SPEC  II  is  the  same  model  with 
the  addition  of  3  mutually  exclusive  dummy  variables  indicating 
the  level  of  prior  year  hospital  use  for  that  enrollee  (e.g.  1,   2  - 
or  3+  admissions) .     California's  marker  diagnosis  risk  adjustment 
system  for  the  State's  small  group  and  individual  health  insurance 
purchasing  cooperative  is  a  simple/direct  claims-based  model 
(Shewry,  1996;  Health  Insurance  Plan  of  California,  .1995)  .  The 
Marker  Diagnosis  Model  is  comprised  of  38  conditions  meeting  the 
following  criteria;  the  condition  is  listed  as  the  primary  or 
secondary  diagnosis  on  a  hospital  admission  and  average  annual 
costs  for  the  condition  exceed  $15,  000.     Examples  from  the  list  of 
38  marker  diagnoses  are  ulcers,  COPD,  epilepsy,  aneurysm  and  liver 
disorder.     Persons  with  more  than  one  marker  diagnosis  in  a  year 
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are  assigned  to  the  highest  cost  marker  diagnosis  only.  Marker 
diagnosis  risk  weights  are  calculated  based  on  the  average  charge 
for  each  person  with  a  marker  diagnosis  divided  by  the  average 
charge  for  all  persons.     This  model  is  relatively  simple  to 
understand  and  to  administer. 

There  are  several  recently  developed  models  in  the 
complex/indirect  claims -based  family;  HCC,  DPS,  DCG,.  PACS,  ACG, 
ADG/HOSDOM,  ADG/MDC  (Anderson,  1990;  Ellis  and  Ash,   1995;  Ellis, 
1996;  Kronick,  1996;  Starfield,  1991;  Weiner,   1991  1996) .  Models 
in  this  family  vary  in  their  complexity  and  indirectness  of  health 
status  measurement .    They  are  reviewed  here  in  order  from  least  to 
most  complex  and  indirect . 

The  Hierarchical  Coexisting  Condition  (HCC)  model  is 
developed  by  Ellis  and  others  at  HERI  and  BU  for  a  combined 
Medicare  aged  and  disabled  population  under  contract  to  HCFA 

(Ellis,  1996) .     The  project  is  concurrent  with  development  of  the 
ADG/HOSDOM  and  ADG/MDC  models  at  JHU.     Those  two  projects 
collaborated  with  respect  to  development  of  selected  data  files 
used  in  their  research  some  of  which  are  also  used  in  this  study 

(Ellis,   1996;  Weiner,  1996). 
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The  HCC  (Ellis,  1996)  measures  health  status  through  the  use 
of  several  medical  condition  markers  rather  than  the  single 
highest  cost  condition  groups  as  in  its  precursor,  the  DCG  model, 
described  in  more  detail  below.     In  the  HCC  model,  estimates  of 
future  costs  are  based  on  a  list  of  conditions  which  may  be  or  may 
not  be  present  for  each  enrollee.    Of  the  34  conditions  in  the  HCC 
model,   13  coexisting  conditions  (e.g.  high  cost  infectious 
diseases,  Protein-Calorie  Malnutrition,  Liver  Diseases, 
Cerebrovascular  Disease)  are  dummy  variables  indicating  presence 
or  absence  of  that  condition  for  each  enrollee.    The  remaining  21 
conditions  are  grouped  in  8  cost  hierarchies  (Neoplasms (4 ) , 
Diabetes (2),  Hematological (2 ) ,  Mental (2),  Neurological (2 )  ,  Cardio- 
respiratory arrest(3),  Heart(4)  and  Lung(2)). 

The  Neoplasm  hierarchy  is  divided  into  four  cost  levels .  A 
person  with  more  than  one  kind  of  cancer  diagnosis  would  be 
classified  only  once  in  the  highest  cost  level  determined  by 
diagnosis  groups.     For  example,  breast  cancer  is  in  the  lowest 
cost  hierarchy  while  metastatic  cancer  regardless  of  site  is  in 
the  highest  cost  category.    A  woman  with  both  diagnoses  would  be 
classified  in  the  highest  cost  cancer  category.    The  8 
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hierarchical  condition  groups  plus  13  conditions  result  in  a 
maximum  of  21  coexisting  conditions  for  any  given  enrollee. 

Diagnoses  are  grouped  by  a  panel  of  physicians  into  the  34 
conditions  according  to  5  criteria;  1)  heterogeneity  of  cost 
between  groups,  2)  clinical  cogency,  3)  sample  size  of  500,  4) 
combining  alternate  diagnoses  for  the  same  condition  5)  mutually 
exclusive  categories.    A  sample  size  of  500  produced  a  relative 
standard  error  for  mean  expenditures  of  about  10  percent  which  is 
deemed  sufficiently  accurate  by  the  researchers  (Ellis,  1996) . 
Clinical  cogency  takes  precedence  over  sample  size  when  the  two 
criteria  conflict.    All  diagnoses  present  on  all  claims  are  used 
to  determine  whether  enrollees  had  any  one  of  34  conditions 

(Ellis,  1996)  .     In  the  HCC  model,  estimated  costs  for  each 
enrollee  are  based  on  the  mean  for  the  population  plus  the  risk 

(cost)  weight  associated  with  each  of  a  possible  21  coexisting 
conditions  present  for  that  enrollee.    (Ellis,  1996). 

Kronick's  Disability  Payment  System  (DPS)   for  the  disabled 
Medicaid  population  is  the  second  model  in  the  complex/direct 
claims  family  (Kronick,  1996)  .     This  model  is  a  list  of  18 
conditions  corresponding  to  body  systems  or  specific  illnesses. 
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Ten  conditions  are  subdivided  into  various  mutually  exclusive  cost 
levels,  similar  to  hierarchical  conditions  in  the  HCC,  referred 
to  as  "hierarchical  categories"    (e.g.  psychiatric,  pulmonary, 
cardiovascular,  diabetes,  hematological,  substance  abuse,  mental 
retardation,  renal,  cerebrovascular  and  AIDS) .  The  remaining  8 

conditions  (CNS,  skeletal  and  connective,  gastrointestinal, 
metabolic,  cancer,  eye  and  ear,  skin  and  gynecological)  are  "fully 
counted  categories" .  Fully  counted  categories  are  also  divided 
into  cost  levels  that  are  not  mutually  exclusive  with  definition 
of  distinct  diagnostic  subgroups  within  each  level .     Each  distinct 
diagnostic  siibgroup  is  counted  separately.     "For  example,  a  person 
with  diagnoses  in  claims  for  epilepsy,  multiple  sclerosis  and 
heart  failure  would  be  assigned  a  count  of  two  low  cost  CNS 
diagnoses  and  one  medium-cost  cardiovascular  diagnosis."  (Kronick, 
1996)  . 

Kronick  reports    two  criteria  guided  grouping  of  diagnosis 
codes  into  hierarchical  categories  and  fully  counted  categories; 
1)  homogeneity  of  costs  and  2)   clinical  cogency  (Kronick,  1996) . 
Primary  diagnoses  on  ambulatory  claims  and  primary  and  secondary 
diagnoses  coded  on  inpatient  claims  are  used  to  count  conditions 
(Kronick,  1996)  . 


70 


The  most  recent  DCG  revision  (Ellis  and  Ash,   19  95) 
is  derived  from  primary  diagnosis  codes  for  hospitalizations 
collapsed  into  78  diagnostic  subgroups.     These  siibgroups  are  then 
aggregated  into  7  diagnostic  cost  groups  primarily  based  on 
similarity  of  mean  total  costs  in  the  prediction  year.     An  eighth 
diagnostic  cost  group  is  added  and  included  all  persons  with  no 
hospitalizations  and  with  very  low  cost  or  highly  discretionary 
admission  diagnoses   (Ellis  and  Ash,   1995) .     Criteria  applied  in 
developing  of  DCGs  are  1)   empirically  determined  homogeneity  of 
future  hospitalization  costs  2)   clinically  meaningful  aggregation 
of  subgroups  and  3)   clinical  discretion  concerning  diagnosis  or 
need  for  hospitalization   (Ash,  1989) . 

Four  additional  examples  of  more  complex/indirect  claims  . 
models  are  all  developed  at  Johns  Hopkins  University:  Payment 
amount  for  Capitated  systems  (PACS)    (Anderson,  1990) ,  Ambulatory 
Care  Groups  (ACGs)    (Starfield,  1991;  Weiner,  1991),  ADG/HOSDOM  and 
ADG/MDC  (Weiner,  1996) . 

The  PACS  model   (Anderson,  1990)  is  comprised  of  demographics 
(age,  sex) ,  disability  status  and  three  types  of  predictors  that. 
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in  descending  order  of  priority:  persistency,  likelihood  of  return 
office  visit,  likelihood  of  specialty  care,  need/cost  of  diagnosis 
and  treatment  procedures,  likelihood  of  hospitalization, 
likelihood  of  disability  and  likelihood  of  death.     Thus,  ADGs 
represent  presence  or  absence  of  a  group  of  conditions  deemed 
similar  with  respect  to  these  criteria.    ADGs  are  not  mutually 
exclusive.     Subsequent  steps  employed  the  same  divisive 
hierarchical  clustering  technique  used  in  developing  DRGs  to 
reclassify  ADGs  into  51  mutually  exclusive  Ambulatory  Care  Groups 
which  take  into  account  age  as  well  as  the  number,  type  and 
severity  of  conditions  grouped  together  (Fetter,  1980;  Starfield, 
1991;  Weiner,  1991) . 

The  ADG/HOSDOM  model  uses  13  of  34  ADGs,  the  initial 
"building  block"  within-  the  ACG  system,  to  indicate  the  presence 
or  absence  of  diseases  coded  in  ambulatory  and  inpatient  claims 
grouped  according  to  criteria  mentioned  above  under  the  ACG  model 
(Weiner,  1996)  .     This  model  adds  a  binary  severity  of  illness 
variable,  HOSDOM,  indicating  presence  or  absence  of    a  "hospital 
dominant"  condition,  i.e.,  one  that  is  sufficiently  severe  to 
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warrant  hospitalization  in  at  least  50%  of  individuals  with  that 
diagnosis  (Weiner,  1996) . 

The  ADG/MDC  model  uses  the  same  13  ADGs  as  predictors, 
however,  unlike  the  ADGs  in  the  ADG/HOSDOM  model  which  are  formed 
from  both  ambulatory  and  inpatient  visits,  ADGs  in  this  model  are 
formed  from  ambulatory  office  visit  claim  diagnoses  only.  Counts 
of  14  MDC  admissions,  four  of  which  are  combinations  of  two  or 
more  unique  MDCs,  are  included  as  predictors  in  this  model. 
The  range  of  complexity  in  the  complex/indirect  claims  model 
family  is  broad.     The  HCC  is  the  simplest  model  in  so  far  as  it  i& 
comprised  of  dummy  variables  indicating  presence  or  absence  of  21 
coexisting  conditions.    Hierarchical  cost  levels    for  21  diseases 
within  8  condition  groups  make  this  model  far  more  complex  than 
the  simple  direct  models.  However,  HCC  is  far  simpler  and  a  more 
direct  measure  of  health  status  than  its  DCG    precursor,  PACS  or 
any  of  the  ACG    models  all  of  which  use  complex  grouping 
algorithms  based  on  homogeneity  of  cost  and/or  clinical  factors. 

There  are  three  recent  examples  of  the  third  family, 
simple/direct  self -report  models.     Two  of  these  are  similar  to  the 
SPEC  I  and  II  models  in  that  they  are  incorporate  a  simple  list  of 
conditions  (Fowles,  1994  1996;  Gruenberg  1996)  . 
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Fowles  and  colleagues  (1994  1996)  performed  "A  Comparison  of 
Alternative  Approaches  to  Risk  Measurement"  for  the  Physician 
Payment  Review  Commission  (PPRC)  using  claims  and  survey  data 
collected  on  a  population  of  6000  adult  and  elderly  enrollees  in 
Minnesota  network  model  HMO  during  the  period  from  June  19  91 
through  June  1993.     Results  are  evaluated  for  two  complex/indirect 
models  using  ACGs  and  ADGs  and  two  simple/direct  self -report 
models:  SF-36  and  13  chronic  conditions.    The  model  with  13 
chronic  conditions  compared  favorably  to  the  more  complex  claims - 
based  models  and  is  recommended  for  use  where  claims  data  is  not 
available  (Fowles,  1994  1996)  .     When  the  13  condition  model  is 
calibrated  separately  for  the  employed  and  elderly  populations, 
two  different  models  of  10  self -reported  conditions  emerged  for 
each  population. 

Gruenberg  and  colleagues  (1996)  used  1991  and  1992  MCBS 
survey  data  to  compare  five  self -report  models;  a  health  model 
comprised  of  a  general  health  measure  and  9  self -reported 
conditions,  an  ADL  model,  a  disability  model  combing  ADLs  with 
disabilities  related  to  walking,  lifting  and  meal  preparation,  a 
comprehensive  health  model  merging  the  health  and  disability 
models  and  a  prior  use  model  that  adds  three  prior  use  measures  to 


75 


the  comprehensive  model.     The  comprehensive  model  is  recommended 
as  the  most  promising  alternative  to  the  AAPCC. 

Hornbrook  and  colleagues  developed  a  simplified  SF-3  6  risk 
assessment  model  using  survey  data  collected  from  7,744  employed 
enrollees  in  an  HMO  (Hornbrook,  1995)  .     The  SF-36  is  comprised  of 
eight  scales  measuring  different  health  status  domains;  physical 
function,  social  function,  role -function,  role-mental,  mental 
health,  energy/fatigue,  pain  and  general/perceived  health  (Ware, 
1992)  .  Four  of  these  scales  are  included  in  the  simplified  SF-36 
model   (physical  function,  social  function,  role -physical  and 
perceived  health)  based  on  consistently  statistically  significant 
effects  on  health  costs  (Hornbrook,  1995)  . 

This  overview  of  recent  models  within  family  group  focuses 
only  on  the  types  of  predictor  variables  and  criteria  used  to 
define  those  variables  where  criteria  are  reported.     The  next 
section  on  risk  adjustment  methods  reviews  techniques  applied  in 
risk  assessment  modeling  and  evaluation  drawn  from  the  same 
research.    A  subsequent  section  compares  results  for  recent  models 
most  relevant  to  this  research. 
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Model  Development  Methods 

This  section  reviews  methods  applied  in  risk  assessment 
modeling  and  evaluation  for  models  described  above 
(Anderson,   1990;  Ash,    1989;  Ellis  and  Ash,   1995;  Ellis, 
1996;  Gruenberg,   1996;   Hornbrook,   1995;   Kronick,  1996; 
Weiner,   1996)   and  projects  that  evaluate  multiple  models 
(AAOA,    1993;  Dunn,   1996;  Fowles,   1996) .  Three 
methodological  issues  are  discussed;   1)   approaches  to 
defining  risk  factor  measures,   2)   techniques  for  model 
calibration  and  validation  and  3)  model  evaluation  criteria 
and  techniques  for  measuring  how  well  those  criteria  are 
met . 

Two  approaches  are  typically  used  for  operational 
definition  of  risk  measures:  rational  and  empirical  analyses. 
The  process  is  recursive  in  nature  requiring  alternating 
application  of  judgment  and  empirical  analyses  beginning 
either  with  empirical  evidence  or  apriori  judgments.  The 
majority  of  recent  risk  assessment  modeling  research  in  the 
complex/indirect  claims  family  begins  with  apriori  clinical 
judgment  about  how  diagnoses  should  be  grouped  into  HCCs, 
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DPS,  ADGs  or  MDCs   (Anderson,  1990;  Ellis,   1996;  ICronick, 
1996;  Starfield,  1991;  Weiner,  1991  1996)  .     Development  of 
DCGs  began  with  empirical  evidence  on  the  homogeneity  of 
future  hospitalization  cost  levels  (Ash,  198  9)  . 

One  empirical  method  used  in  risk  assessment  modeling 
to  determine  the  operational  definition  of  more  complex  risk 
predictors   (e.g.  ACGs,   DCGs)   is  divisive  hierarchical 
clustering   (DHC)  .     This  technique  is  used  in  detennining  how 
to  group  variables  in  the  Diagnostic  Resource  Group 
prospective  payment  system  (Fetter,   1980) .     It  is  also 
employed  in  the  development  of  ACGs    (Starfield,   1991)  . 

This  technique  requires  a  continuous  dependent  variable 
such  as  cost  or  number  of  visits  for  which  variance  can  be 
computed.     At  initiation,   the  process  assumes  all  cases 
belong  to  one  cluster,   e.g.  one  risk  group.     That  group  is 
then  divided  into  two  groups  based  on  the  value  of  an 
independent  variable,  providing  maximum  reduction  in 
variance  since  the  last  split.     This  process  is  repeated 
until  the  variance  in  the  dependent  variable  reduced  by  the 
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split  is  less  than  the  level  specified  by  the  researcher 
(e.g.    .01),   the  last  group  produced  by  the  split  contains 
fewer  than  the  minimum  specified  by  the  researcher  (e.g.  30) 
and/or    the  final  number  of  total  clusters  or  classes 
specified  by  the  researcher  is  obtained  (Weiner,   1991) . 
Depending  upon  the  software  used  to  implement  divisive 
hierarchical  clustering,   at  each  "split",  the  split  variable 
can  be  determined  empirically  or  specified  by  the  researcher 
based  on  apriori  judgment.     The  DRG  and  ACG  development 
projects  used  Yale's  AUTOGROUP"  software   (Fetter,   198  0; 
Theriault,   1979;  Weiner  1991) . 

Once  predictors  have  been  defined,   statistical  methods 
for  calibrating  and  validating  the  model  are  then  applied. 

Estimation  of  risk  assessment  models  in  the  development 
process  has  most  often  been  accomplished  by  the  use  of 
ordinary  least  squares    (OLS)   regression  (Anderson,  1990; 
Ash,   1989;  Ellis  and  Ash,   1995;  Ellis,   1996;  Gruenberg, 
1996;   Hornbrook,    1995;   Kronick,    1996;  Weiner,    1991  1996). 
The  statistical  model  for  OLS     regression  is 

Y  =  A  +  BiXi  +  E 
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subject  to  the  following  assumptions: 

1.       linearity  of  the  relationship  between  future  cost 
(Y)   and  risk  factors  (X) 

2-       independence  of  risk  factors    (X)   from  the  error  term 
(E) 

3 .  normal  distribution  of  costs   (Y)   for  each 
risk  factor  (X) 

4.  equal  variance  in  the  cost  error  term  (E)   for  each 
risk  factor  (i.e.  heteroscedasticity) . 

With  large  numbers  of  cases,   OLS  regression  is  quite 
robust  with  respect  to  violation  of  these  assumptions.  Many 
recent  risk  modeling  projects  use  over  100,000  cases  for 
model  development    (Anderson,   1990;  Ash,   1989;  Ellis  and  Ash, 
1995;   Ellis,    1996;  Weiner  1996). 

Even  though  OLS  regression  is  robust,  health  care  cost 
distributions  are  highly  skewed.     A  study  of  US  health 
expenditure  in  1987  found  the  top  1%  of  the  population 
accounts  for  3  0%  of  expenditures  and  the  bottom  50%  account 
for  3%   (Berk  and  Monheit,   1992).       Various  techniques  have 
been  applied  to  address  this  issue  using  a  single  OLS 
regression. 


80 


One  approach  is  to  perform  a  logarithmic  transformation 
of  the  dependent  cost  variable  and  a  retransf ormation  of  the 
estimates.     Some  studies  have  estimated  models  using  this 
technique   (Fowles,    1994;   Ellis,    1996).     However,  use  of 
retransf ormed  estimates  causes  problems  for  interpreting 
results   (Dunn,   1995)  ,   has  less  intuitive  appeal   (Dunn,  1995) 
and  is  less  comprehensible  for  some  when  applied  in  the  risk 
adjustment  process   (Dunn,   1995;   Fowles,  1994) .  Several 
recent  studies  truncate  the  dependent  variable  at  a  dollar 
value  representative  of  reinsurance  levels  for  that  market 
(Anderson,   1990;   Dunn,    1995;   Ellis,    1996;   Fowles,  1994; 
Weiner,   1996)    and/or  use  untransf ormed  total  expenditures 
(Dunn,   1995;   Ellis,    1996;   Fowles,    1994;  Hornbrook,  1995; 
Kronick,   1996;  Weiner,   1996).     Arguments  for  using 
untransf ormed  and  untruncated  total  expenditures  include 
accurate  representation  of  the  total  underlying  risk  and 
lack  of  sensitivity  of  prediction  to  outliers  (Hornbrook, 
1995)  . 
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Two  part   (or  multiple  part)  modeling  has  been  suggested 
as  another  technique  for  dealing  with  skewed  cost 
distributions  in  risk  assessment  models  for  health 
expenditures  and  for  improving  accuracy  of  prediction 
(Dowd,    1996;   Fowles,   1996;  Robinson,   1991) .     Two  part 
modeling  entails  modeling  the  probability  of  any  expenditure 
for  selected  risk  factors  followed  by  modeling  the  effect  of 
selected  risk  factors  on  expenditures  for  enrollees  with 
positive  expenditures.     However,  the  second  part  of  the 
process  still  has  a  skewed  distribution  that  must  be 
addressed  using  the  options  indicated  above.     Objections  to 
log  transformation  noted  above  would  still  apply  as  would 
arguments  for  truncation  and  use  of  total  expenditures. 

This  dissertation  project  adopts  the  OLS  regression 
method  using  untransf ormed  and  truncated  expenditures  as  the 
dependent  variable  for  reasons  discussed  above,   following  a 
majority  of  the  most  relevant  recent  research  (Dunn,  1995; 
Ellis,    1996;   Fowles,    1994;  Hornbrook,   1995;  Kronick,  1996; 
Weiner,    1996) . 

Model  validation  is  the  application  of  a  'frozen'  model 
to  a  new  population  that  assesses  the  external  validity  of 
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the  model   (Harrell,    1996) .     There  are  three  principal 
techniques  for  validation;   split - sample ,  cross-validation 
and  bootstrapping  (Harrell,    1996;   Picard,   1990;  Efron  and 
Gong,   1983)  . 

Split-sample  validation  involves  random  assignment  of 
cases  from  the  full  sample  to  a  "training"  or  development 
split   (e.g.   1/2-2/3  of  the  cases)   and  a  validation  split 
comprised  of  the  remaining  cases .  Parameter  estimates  from 
the  training  samples  are  applied  to  predict  the  validation 
sample.     Predictive  accuracy  is  then  calculated  as  an 
indicator  of  external  validity.     Split-sampling  has  two 
potential  disadvantages;  it  requires  a  relatively  large 
sample  to  ensure  unbiased  parameter  estimation  and  accuracy 
of  estimates  can  vary  widely  between  the  training  and 
validation  splits  (Harrell,   1996).     However,   when  sample 
size  permits  and  a  single  split  is  reasonable  with  respect 
to  geography,   location  and  time,   the  split-sample  technique 
has  the  advantage  of  being  a  simple  and  stringent  validation 
of  generalizability   (Harrell,   1996)  .  Split-sample 
validation  is  applied  in  the  majority  of  claims-based  risk 
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assessment  modeling  where  large  samples  are  relatively 
easily  obtained   (Anderson,   1990;  Ash,   1989;  Ellis  and  Ash, 
1995;   Ellis,    1996;   Kronick,   1996;  Weiner,  1996). 

Cross-validation  is  split -sampling  that  retains  a  large 
proportion  of  cases   (e.g.   95%)   in  the  training  split  and  is 
repeated  hundreds  of  times  with  results  averaged  over  the 
repetitions   (Harrell,   1996) .     Like  split  sampling,  accuracy 
of  estimates  can  vary  widely  between  estimates  but  cross- 
validation  can  be  performed  with  much  smaller  sample  sizes 
(Harrell,   1996).     Cross-validation  is  not  performed  in  the 
risk  assessment  modeling  projects  reviewed  which  favored 
either  split-sample  validation  or  bootstrapping. 

Bootstrapping  is  a  technique  that  is  particularly 
useful  with  smaller  sample  sizes  and,  while  it  entails 
modeling  a  large  number  of  random  samples  with  replacement 
from  the  original  sample,   it  requires  far  fewer  repeated 
samples  than  cross-validation  and  yields  relatively  unbiased 
estimates  of  predictive  accuracy  with  low  variance  (Harrell, 
1996) .     Bootstrapping  has  been  applied  in  some  recent  risk 
assessment  modeling  research  where  study  sample  sizes  are 
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relatively  small   (e.g.  under  10,000)    (Fowles,   1994  1996; 
Hornbrook,   1995)  .     Boostrapping  has  also  been  used  in 
projects  with  large  sample  sizes  to  test  model  performance 
at  different  enrollee  group  sizes    (Weiner,    1996)  . 

This  dissertation  project  adopts  split-sample 
validation  because  it  is  most  appropriate  for  the  large 
sample  size  in  this  study  and  is  consistent  with  the  two 
most  relevant  recent  projects  that  use  the  same  samples  and 
data   (Ellis,   1996;  Weiner,    1996)  . 

Techniques  for  calculating  predictive  accuracy  for  the 
validation  sample  are  discussed  in  the  next  section  since 
predictive  accuracy  is  the  principal  evaluation  criteria  for 
capitation  payment  risk  modeling. 

Evaluation  Criteria  and  Methods 

Four  criteria  commonly  applied  in  evaluation  risk 
assessment  models  are  1)  predictive  accuracy,  2) 
administrative  feasibility,   3)   ease  of  auditing/gaming  and 
4)   incentives  for  cost  effective  care   (Anderson,   1990;  Dunn, 
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1995;   Ellis,    1996;   Fowles,   1994;  Newhouse,    1986;  Weiner, 
1996)  . 

Risk  assessment  researchers  differ  over  whether 
individual  or  group  predictive  accuracy  is  more  important 
when  evaluating  risk  models   (Dunn,   1995) .     The  relative 
importance  of  the  measures  varies  depending  on  the 
objectives  for  the  risk  measure   (Dunn,   1995) .     If  the 
primary  objective  is  equitable  distribution  of  contributions 
to  plans  thereby  minimizing  incentives  for  risk  selection, 
then  group  predictive  accuracy  is  of  higher  priority, 
"...concern  about  the  poor  explanatory  power  of  models 
focusing  on  individual  expenditures  is  misplaced  if  the  real 
question  is  how  well  one  can  predict  for  groups, 
particularly  groups  reflecting  nonrandom  risk-related 
selection."    (Luft,    1995).  • 

Predictive  accuracy  at  the  individual  level  is  also  of 
interest  because  it  minimizes  incentives  for  selection 
within  risk  groups  at  enrollment  or  through  disenrollment 
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(Newhouse,   1989) .     This  issue  is  particularly  important  for 
the  most  vulnerable  high  cost    individuals  and  those  with 
chronic  conditions   (Anderson,   1990;  Kronick,  1995). 
However,   the  importance  of  individual  predictive  accuracy  to 
minimize  within  risk  class  selection  incentives  necessarily 
differs  with  the  degree  of  homogeneity  or  variance  within 
and  the  definition  of  those  classes.     Theoretically,  more 
specific  and  narrowly  defined  risk  classes  have  more 
homogeneity  within  class  and,  consequently,   less  opportunity 
for  within  class  risk  selection  rendering  individual 
predictive  accuracy  less  important.     Risk  classes  formed 
using  chronic  or  high  cost  conditions  also  decrease  the 
chance  of  discrimination  against  individuals  with  those 
conditions  and  provide  incentives  for  health  plans  to  enroll 
such  members   (Kronick,   1996) . 

The  primary  purpose  of  risk  adjusters  for  capitation 
payments  is  to  set  correct  payment  rates  for  groups  of 
enrollees  that  minimize  incentives  for  selection  bias 
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(Hornbrook,    1995;  Luft,   1995) .     Although  both  types  of 
predictive  accuracy  are  important,  group  predictive 
accuracy,  particularly  for  systematically  biased  groups 
biased  with  respect  to  enrollee  characteristics  that  could 
be  known  by  a  health  plan  and  used  to  segment  a  market  based 
on  risk,   is  the  most  critical  criteria  for  determining  the 
predictive  accuracy  of  risk  adjustment  models  (Hornbrook 
1995,  Luft  1995) .     For  that  reason,   although  several  methods 
used  in  recent  research  are  described  below  to  provide  a 
balanced  view  of  current  methods,  only  the  adjusted  R2 
measure  is  adopted  for    this  dissertation  research. 
Individual  predictive  accuracy  of  models  can  be  assessed  and 
compared  using  the  following  measures;  adjusted  R2  and 
maximum  R2    (Hornbrook,   1995) . 

The  adjusted  R2  statistic  indicates  the  amount  of 
variance  in  the  dependent  variable   (e.g.  health 
expenditures)   explained  by  the  predictors  in  the  model 
adjusting  for  the  number  of  predictors  which  enables 
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comparison  across  models  with  varying  degrees  of  freedom. 
The  amount  of  variance  in  expenditures  explained  varies  by 
model  and  population  to  which  the  model  is  applied. 
Adjusted  R2  varies  from  .01  to  .12  for  most  recent  research 
(Anderson,    1990;  Ash,   1989;  Dunn,    1995;   Ellis  and  Ash,  1995; 
Ellis,    1996;   Fowles,   1994  1996;  Hornbrook,   1995;  Weiner, 
1996)  . 

Maximum  R2  is  the  estimated  maximum  amount  of  variance 
that  could  be  explained  assuming  measurement  error  in  the 
dependent  and  independent  variables  is  0  and  only  random 
(i.e.  unpredictable  error)   remains  in  the  error  term. 
Estimates  of  maximum  R2  for  capitation  risk  assessment 
modeling  range  from  .15  to  .20   (Newhouse,   198  9;  van  de  Ven, 
1994;  Welch,   1985).     Although  the  concept  of  maximum  R2  has 
a  certain  appeal  for  evaluating  the  relative  performance  of 
models  for  individual  predictive  accuracy,   it  is  less 
readily  comprehensible  and  can  lead  to  confusion  if  readers 
interpret  that  value  as  R2 . 
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The  vast  majority  of  recent  research  reports  the 
adjusted  R2  as  a  measure  individual  predictive  accuracy 
(Anderson,    1990;  Ash,    1989;  Dunn,   1995;  Ellis  and  Ash,  1995; 
Ellis,   1996;  Gruenberg,   1996;  Hornbrook,   1995;  Kronick, 
1996;  Weiner,   1996)  . 

Dunn  et .  al .    (1995)   use  additional  measures  of 
individual  predictive  accuracy: 

Mean  absolute  prediction  error 

Standard  deviation  of  mean  absolute  prediction  error 
Percent  of  absolute  prediction  errors  near  the  mean 
Percent  of  absolute  prediction  errors  near  the  tails 

These  measures  of  individual  accuracy  augment  adjusted  R2 
by  providing  information  related  to  two  limitations  of  R2  as 
a  summary  measure   (Dunn,   1995)  .     First,  R2  is  effected  by 
outliers.  Comparing  the  mean  absolute  prediction  error  is 
one  way  to  determine  the  relative  effect  of  outliers  on 
different  models  applied  to  the  same  data;   the  greater  the 
effect,  the  higher  the  mean  absolute  error.     Second,  R2 
gives  no  indication  of  the  distribution  of  prediction  errors 
e.g.  a  lot  of  moderate  errors  or  a  few  large  errors  with 
many  small  errors.     Comparing  the  standard  deviation  of  the 
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absolute  prediction  error,   the  percent  of  absolute 
prediction  error  close  to  the  mean  and  in  the  tails  of  the 
distribution  together  provide  an  indication  of  whether 
models  achieve  predictive  accuracy  evenly  across  the 
distribution  or  whether  they  perform  better  in  the  center  or 
toward  the  extremes  of  the  distribution  (Dunn,  1995)  . 

There  is  no  standard  for  measuring  group  predictive 
accuracy   (Dunn,   1995;   Fowles,   1994) ,   the  most  important 
measure  for  capitation  risk  assessment  models  (Hornbrook, 
1995;  Luft,   1995).     Several  methods  for  assessing  group 
accuracy  are  described  below  to  provide  a  balanced  review  of 
current  methods.     However,   only  the  predictive  ratio  is 
adopted  in  this  dissertation  research  because  it  is  the  most 
widely  used  measure,   is  common  to  recent  research  most 
relevant  to  this  project   (Ellis,   1996;  Weiner,   1996)   and  is 
easy  to  calculate  and  to  interpret. 

Group  predictive  accuracy  has  most  often  been  measured 
by  a  Predictive  Ratio   (PR)    (Anderson,   1990;  Ash,    1989;  Dunn, 
1995;   Ellis,    1996;   Fowles,    1994;   Kronick,    1996;  Weiner, 
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1996) .  The  PR  is  calculated  using  the  formula  (mean 
predicted  expenditures /mean  actual  expenditures)  (Anderson, 
1990;  Ash,    1989;  Ellis  and  Ash,    1995).     A  value  of  1.0 
reflects  perfect  accuracy.     Values  above  1  indicate 
overprediction.    Values  less  than  1  indicate 
underprediction.     Although  there  is  no  standard  for 
determining  what  range  of  PR  indicates  acceptable  accuracy, 
researchers  tend  to  use  a  5%  or  10%    band  about  1.0   (i.e.  PR 
=0.95  to  1.05)  as  an  indicator  of  acceptable  accuracy  for 
capitation  risk  models   (Fowles,   1994;  Weiner,  1996)  .  The 
predictive  ratio  is  both  simple  to  calculate  and  to 
comprehend  as  a  relative  risk  measure  for  accuracy  of 
capitation  payments. 

Grouped  R2    (Ash,    1989;   Hornbrook,    1995;   Luft,    1995)  is 
a  second  measure  of  group  predictive  accuracy.       Ash  et.al. 
developed  one  concept  of  group  R2  as  an  analog  to  individual 
R2  to  assess  predictive  accuracy  of  models  at  the  group 
level   (Ash,   1989;   Ellis  and  Ash,   1995)  .     Ellis  and  Ash 
(1995)  provide  a  detailed  description  of  this  formula: 
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The  grouped  R2  for  a  partition  of  a  population  into 


subgroups   (risk  classes)  defined  (Ellis  and  Ash,   1995)  as: 

Grouped  R2=1-GSS (Model) /GSS (Total) 

Where  GSS (Model)    =     sum  over  subgroups  of 

the  partition  of 
(#  of  people  in  the  subgroup)  * 
(average  predicted  for  subgroup 

-  average  actual  for  subgroup)^ 
and  GSS  (Total)  =          sum  over  subgroups  of 

the  partition  of 

(#  of  people  in  the  subgroup)  * 

(average  predicted  for  subgroup 

-  average  actual  for  population)^ 

Hornbrook  et.al.   apply  an  alternate  Grouped  R2 
technique  following  Luft  and  Rosencrantz   (1993) .     Here,  one 
sorts  predicted  values  obtained  in  the  validation  sample  and 
partitions  the  sorted  observations  into  equal  sized 
subgroups   (e.g.  Hornbrook  1995  used  50  subgroups)  simulating 
groups  biased  with  respect  to  predicted  expenditures.  Mean 
actual  expenditures  are  then  regressed  on  mean  predicted 
expenditures  for  each  subgroup  to  obtain  this  version  of 
Grouped  R2   (Hornbrook,   1995) . 
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Assuming  the  Ash  approach   (Ash,    1989;  Ellis  and  Ash, 
19  95)  uses  the  equivalent  of  actuarial  risk  classes  and/or 
systematically  biased  subgroups  formed  from  knowledge  of 
enrollee  characteristics  that  could  be  the  basis  for  risk 
segmentation,   it  provides  a  particularly  relevant  predictive 
accuracy  measure  since  the  primary  purpose  of  risk  adjusted 
payments  to  plans  is  to  remove  financial  incentives  for  risk 
segmentation.  The  Rosencrantz  and  Luft    (1993)  application 
differs  considerably  in  that  it  partitions  subgroups  on  the 
basis  of  predicted  values  which  could  not  be  known  at 
enrollment  and,   therefore,   could  not  be  used  to  segment  the 
market  based  on  risk.     However,    this  method  is  useful  as  an 
analog  to  the  Hosmer  and  Lemeshow  goodness  of  fit  Chi  Square 
statistic  for  logistic  models    (Hosmer,    1991)   calculated  by 
partitioning  the  predicted  odds  into  deciles  and  calculating 
Chi  Square  for  those  10  partitions. 

Additional  measures  of  group  predictive  accuracy 
employed  in  various  studies   (Dunn,   1995;  Fowles,  1995) 
include : 
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Mean  absolute  prediction  error 

Standard  deviation  of  the  absolute  prediction  error 
Percent  of  absolute  prediction  errors  within  5  % 
Percent  of  absolute  prediction  errors  within  10% 
Mean  absolute  percent  prediction  error 
Percent  mean  squared  error 

The  first  four  of  these  measures  are  essentially  the  same  as 
those  used  for  individual  predictive  accuracy  described 
above  and  are,  therefore,     more  a  reflection  of  individual 
predictive  accuracy  than  group.     The  mean  absolute  percent 
prediction  error  and  percent  mean  squared  error  are 
standardized  indicators,  again,  based  on  individuals  as  the 
unit  of  analysis  and,   therefore,  more  a  measure  of 
individual  than  group  accuracy  (Hornbrook,   1995) .     If  these 
statistics  used  risk  class  or  systematically  biased  groups 
as  the  unit  of  analysis  and  used  predicted  and  actual 
measures  of  central  tendency,  then  they  would  be  reflective 
of  group  accuracy. 
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Evaluation  Findings 

Findings  regarding  predictive  accuracy  from  selected 
studies  are  now  summarized  before  discussion  of  how  recent 
models  compare  with  respect  to  the  remaining  three 
evaluation  criteria. 

Findings  from  five  studies  are  most  relevant  to  this 
research  either  because  they  apply  to  the  Medicare 
population   (Ellis,   1996;  Gruenberg,   1996;  Weiner,    1996)  and 
use  the  same  data  as  this  research  for  model  development  and 
evaluation  (Ellis,   1996;  Weiner,    1996)     or  because  they 
examined  models  containing  simple  lists  of  specific 
conditions   (Fowles,   1994;  Gruenberg,   1996)  .     The  five 
studies  reviewed  are  the  JHU  project  that  developed  the 
ADG/HOSDOM  and  ADG/MDC  models    (Lewin  and  JHU,   1995;  Weiner, 
1996) ,  the  HERI  study  that  developed  the  HOC  model  (Ellis, 
1996) ,   the  PPRC  study  that  compared  ADG-based  models  with  a 
model  comprised  of  13  self -reported  conditions   (Fowles,  1994 
1996)   and  Gruenberg' s  risk  model  from  the  MCBS  (Gruenberg, 
1996)  . 
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Interpretation  of  predictive  accuracy  findings  must  be 
made  with  caution  because  some  of  these  studies  employed 
populations   (Fowles,   1994)   and  dependent  variables 
(Gruenberg,   1996)   that  differ  from  the  remaining  studies. 

Figure  III -7  summarizes  findings  for  three  of  these 
studies   (Ellis  1996,   Fowles  1994  1996,  Weiner  1996) . 
Unfortunately,   the  PPRC  study  did  not  use  predictive  ratio 
as  a  measure  of  group  accuracy  (indicated  by  N/R  (not 
reported)   in  Figure  III -7) .     Nor  are  group  predictive 
accuracy  measures  employed  reported  consistently  across  all 
models  compared  in  that  study.     Therefore,  no  specific 
measures  of  group  predictive  accuracy  are  shown  for  the  PPRC 
study  in  figure  III -7.     However,   the  PPRC  report  did 
summarize  predictive  group  accuracy  as  comparable   ("good  for 
all  groups  tested")   for  the  ADG  and  Chronic  Conditions 
models   (Fowles  1994) . 
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FIGURE  III-7 
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Figure  I II -7  suggests  that  models  with  simpler  and  more 
direct  measures  of  health  status   (e.g.  HCC  compared  to 
ADG/HOSDOM  and  ADG/MDC,   and  PPRC  13  Cond.   compared  to  PPRC 
ADG)  achieve  comparable  or  better  individual  predictive 
accuracy  than  the  more  complex  indirect  models  for 
untruncated  and  truncated  dependent  variables.     For  example, 
with  no  truncation,  HCC  adjusted  R2=.081  compared  to  .063 
(ADG/MDC)   and  .055    (ADG  HOSDOM)   for  the  more  complex  models. 
The  same  pattern  applies  for  truncation  at  $50k;.     The  same 
three  models  perform  equally  well  for  random  groups  of  large 
sample  size    (>  50,000)   with  R2=1.0. 

Comparison  across  the  three  models  reporting  predictive 
ratios  for  groups  biased  with  respect  to  1991  expenditure 
quintiles  and  number  of  hospitalizations  is  somewhat 
problematic  because  the  ADG-based  measures  are  calculated 
for  the  Medicare  aged  only  and  the  HCC  measure  is  calculated 
for  the  aged  and  disabled  combined.     With  that  limitation  in 
mind,  we  note  that  all  three  models  overpredict  the  lowest 
60%  of  costs  with  the  HCC  least  accurate  and  the  ADG/HOSDOM 
most  accurate .     The  ADG/HOSDOM  and  HCC  models  are  comparable 
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for  the  top  40%  of  the  cost  distribution;  most  accurate  for 
the  fourth  quintile  and  least  accurate  for  the  top  quintile. 

Of  the  three  models  compared,   the  HCC  model  is  the  only- 
model  with  no  prior  hospital  use  indicator.     This  confirms 
findings  from  other  research  that  prior  use,  particularly 
hospitalization,   is  a  good  predictor  of  future  use 
(Anderson,   1990;  Epstein  and  Cummella,   1988;  Gruenberg, 
1996) -     Perhaps  prior  use  should  be  reconsidered  for 
inclusion  in  risk  assessment  models  if  appropriate  checks 
and  balances  can  be  included  in  the  risk  adjustment  process 
to  control  for  gaming  and  disincentives  for  cost  effective 
care . 

Gruenberg' s  comprehensive  model  is  not  included  in 
Figure  III -7  because  the  dependent  variable  in  this  study  is 
the  cost  ratio   (Individual  Expenditure/Mean  Expenditure)  and 
there  are  no  common  measures  of  group  predictive  accuracy. 
The  adjusted  R2  for  the  comprehensive  model   (9  self  reported 
chronic,   3  ADL  and  3  Disability  measures)    is  0.06, 
comparable  to  the  amount  of  variance  explained  in  the 
untruncated  ADG/HOSDOM,  ADG/MDC  and  HCC  models.  Although 
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Gruenberg  did  not  report  predictive  ratios,     he  offers 
tables  showing  predicted  and  observed  cost  ratios  obtained 
for  groups  biased  with  respect  to  quartiles  of  expected  cost 
ratios  calculated  using  the  comprehensive  model,   the  top  5% 
of  expenditures,   four  health  conditions,   one  ADL,  one 
disability,   and  three  measures  of  prior  use.     For  all  biased 
subgroups,   except  those  based  on  prior  utilization, 
predicted  costs  are  within  5%  of  actual  costs.     This  finding 
supports  the  PPRC  study  conclusion  that  predictive  accuracy 
of  self -reported  chronic  condition  models  is  comparable  to 
that  of  the  more  complex/indirect  claims-based  ADG  model 
(Fowles,   1994) .     However,   Gruenberg  concludes  that  it  is 
necessary  to  include  severity  of  illness  measures,   such  as 
activity  limitation  and -disability  indicators,   along  with 
diagnostic  variables  to  adequately  fit  a  model . 

The  other  three  criteria  for  evaluating  risk  assessment 
models   (administrative  feasibility,  ease  of  gaming/auditing 
and  incentive  for  cost/ef f ective  care)     are  more  subjective 
and  less  quantifiable  than  predictive  accuracy. 
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Administrative  feasibility  pertains  to  the 
administrative  burden  for  the  organizations  that  must 
implement  the  risk  assessment  model.     There  are  several 
questions  relevant  to  administrative  feasibility.     Does  the 
data  source  exist?     How  much  time,  effort  and  money  will 
data  collection  require?     Can  the  process  be  accomplished  in 
a  timely  fashion  and  within  the  time  constraints  required 
for  payment  transfers  between  plans? 

There  is  general  agreement  that  claims-based  systems 
are  comparable  with  respect  to  administrative  feasibility. 
Where  claims  data  exist,   they  are  relatively  cost  effective 
because  they  leverage  existing  data  (Dunn,   1995;  Ellis, 
1996;  Fowles,   1994  1996;  Weiner,   1996) .     However,  claims 
data  are  typically  not  available  for  new  plan  enrollees  and 
claims  do  not  provide  a  means  for  capturing  unmet  demand 
(Fowles,    1994;  Gruenberg,    1996;  Hornbrook,   1995) .  Start-up 
costs  for  claims -based  measures  are  high  where  the  data  does 
not  exist   (Fowles,   1994) . 

Survey-based  measures  from  a  representative  sample  of  a 
population  incur  costs  similar  to  existing  claims-based 
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systems   (Fowles,   1994).     Compared  to  claims-based  measures, 
self -report  measures  have  the  advantage  of  providing 
comparable  information  for  new  and  existing  enrollees  and 
tapping  unmet  demand  (Gruenberg,   1996;  Hornbrook,  1995). 
Ease  of  gaming  and  auditing  varies  depending  upon  the  data 
source  and  complexity  of  the  health  status  measure . 

Claims-based  models  that  use  diagnosis  codes  to  define 
predictors  are  subject  to  "upcoding"  practices   (Ellis,  1996; 
Fowles,   1994;  Weiner,   1996).     However,   "upcoding"  can  be 
managed  by  periodically  recallibrating  risk  weights  (Ellis, 
1996)  .  Self -report  measures  may  be  subject  to  influence  by 
health  plans   (i.e.   "upreporting" )    through  practices  such  as 
biased  selection  of  survey  respondents,   instrument  bias  and 
coaching  (Fowles,   1994).     Biased  instruments  and  sampling 
could  be  managed  by  use  of  standardized  instruments  within  a 
particular  market  and  use  of  independent  third  party  survey 
administration  (Fowles,   1994).     Like  "upcoding" , 
"upreporting"  due  to  coaching  might  be  managed  by  periodic 
recalibration  of  risk  weights. 
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within  data  sources,   ease  of  gaming  and  auditing  are 
directly  related  to  the  complexity  of  the  measure;  the 
simpler  the  measure  the  easier  it  is  to  game  and  to  audit. 
For  example,   in  order  to  game  or  audit  a  claims -based  model 
that  uses  a  complex  grouping  algorithm  (e.g.  ACGs/ADGs) ,  it 
is  necessary  to  understand  the  grouping  rules  both  to  game 
and  audit  the  system.     The  more  complex  the  measure,  the 
more  complex  the  audit  trail.     Simple  measures  such  as  a 
limited  list  of  20-30  claims-based  or  self -reported 
conditions  can  be  relatively  easily  audited  by  referring  to 
an  alternate  source  of  data  such  as  the  medical  record 
(Fowles,   1994) .     More  complex  and  subjective  self -report 
measures  such  as  the  SF-36  have  a  disadvantage  in  that  they 
are  difficult  to  audit   (Fowles,   1994) . 

Incentives  for  cost-effective  care  are  directly  related 
to  group  predictive  accuracy,  particularly  to  groups  biased 
with  respect  to  risk  and/or  characteristics  concerning 
social  equity   (e.g.  race,   ethnicity,   income  level  etc.)  .  To 
the  extent  that  predictive  accuracy  is  consistently  high  for 
a  broad  range  of  systematically  biased  subgroups,  the  risk 
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model  minimizes  incentives  to  undertreat  high  risk 
enrollees .     Formulation  of  an  appropriate  conceptual 
framework  for  applied  risk  models  discussed  above  eliminates 
many  types  of    predictors  that  would  provide  disincentives 
for  cost-effective  care   (e.g.   specialist  mix,  hospital 
beds/1000) . 

Prior  use  measures  are  typically  excluded  from  risk 
models  because  they  could  provide  disincentives  for  cost 
effective  care.     As  noted  above,  perhaps  it  would  be 
possible  to  incorporate  simple  prior  use  measures  (e.g. 
level  of  prior  hospital  use  such  as  1  admission  and  2  or 
more  admissions) .     Risk  adjustment  models  cannot  address  all 
concerns  related  to  incentives  and  should  be  augmented  by 
utilization  review,  practice  profiling  and  quality  assurance 
methods   (Hillman,   1993) . 

Findings  from  recent  capitation  risk  modeling  research 
concerning  predictive  accuracy,   administrative  feasibility, 
ease  of  gaming/auditing  and  incentives  for  cost  effective 
care  indicate  that  a  risk  assessment  model  comprised  of  a 
simple  list  of  self -reported  conditions  performs  as  well  as 
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more  complex  claims-based  models  such  as  those  derived  from 
ACGs.     Although  certain  advantages  and  disadvantages  related 
to  the  four  criteria  vary  somewhat  across  models,  as 
discussed  above,  disease  specific  models   (e.g.  self -reported 
chronic  conditions,  California  Marker  Diagnosis,  HCC,  DPS) 
appear  to  be  emerging  as  the  most  effective  types  of  risk 
adjustment  models.     As  indicated  above,  although  there  are 
two  self -report  models  containing  lists  of  conditions,  no 
claims-based  models  comprised  of  a  simple  list  of  conditions 
has  been  developed  and  evaluated  for  the  Medicare  aged.  The 
California  Marker  Diagnosis  model  for  the  small  group  and 
individual  market  is  closest  to  a  simple  list  of  claims- 
based  conditions  followed  by  the  HCC  model  for  the  Medicare 
aged  and  disabled,  and  the  DPS  model  for  the  Medicaid 
disabled. 
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Gap  in  Research  Addressed  by  this  Project 

This  research  develops  and  evaluates  a  simple/direct 
claims  model  comprised  of  a  list  of  3  0  specific  conditions 
from  diagnoses  listed  on  ambulatory  and  inpatient  claims 
data.     This  is  the  first  model  in  the  simple/direct  claims 
family  for  the  Medicare  aged  population. 

Just  as  there  is  a  bias  against  Prior  Use  risk  models 
because  they  could  provide  disincentives  for  cost  effective 
care,     there  has  been  a  bias  against  specific  condition 
models  because  their  simplicity  may  make  them  easier  to  game 
or  manipulate  than  more  complex  models    (GAO  1994)  .   In  the 
past  five  years  techniques  for  practice  profiling  and 
utilization  review  have  advanced  to  the  point  where  they 
could  be  quite  useful  in  auditing  specific  condition  risk 
model  gaming  practices  among  providers    (Lawthers,   1994) . 
Moreover,  gaming  due  to  upcoding  can  be  controlled  by 
recalibrating  models  on  an  annual  basis    (Ellis,  1996). 
Concerns  over  gaming  and  the  fact  that  these  techniques  for 
auditing  simple  specific  condition  models  are  not  available 
in  earlier  years  may  explain  why  no  simple  clinical  models 
have  been  developed  for  Medicare  until  now. 
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The  next  chapter  describes  methods  used  in  this  project 
to  develop  and  validate  two  specific  condition  models  for 
the  Medicare  aged  population. 
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CHAPTER  IV:  RESEARCH  METHODS 

OVERVIEW 

A  detailed  description  of  data  sources,  sample 
design,  dependent  and  independent  variables,  model 
evaluation  criteria,   statistical  analyses  and  data 
processing  environment  for  this  research  is  presented 
following  this  overview. 

Briefly,   a  split  random  sample  approach  using 
administrative  data  from  the  standard  Medicare  5%  sample 
is  employed  to  develop  and  validate  two  specific 
condition  risk  adjustment  models    (SPEC  I  and  SPEC  II)  for 
the  Medicare  aged  population. 

Stepwise  and  ordinary  least  squares  linear 
regression  are  used  to  select  specific  conditions  and 
interaction  terms  for  a  linear  risk  adjustment  model. 

Generalizability  of  the  resulting  model  is  tested 
using  split  samples.     Results  of  model  parameter 
estimates   (estimated  expenditure  per  condition)  and 
individual  predictive  accuracy  (adjusted  R-Square)  are 
calculated  for  both  samples  and  their  comparability 
assessed. 
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Group  predictive  accuracy  is  assessed  with  - 
predictive  ratios .     As  discussed  above  in  the  methods 
literature  review,   the  predictive  ratio  is  adopted  in 
this  dissertation  research  because  it  is  the  most  widely 
used  measure,    is  common  to  recent  research  most  relevant 
to  this  project   (Ellis,   1996;  Weiner,   1996)   and  easy  to 
calculate  and  to  interpret.     Ratios  are  calculated  in  the 
aggregate  for  total  expenditures,   two  reinsurance  levels 
($100k  and  $50k) .     Ratios  are  also  calculated  for  non- 
randomly  biased  groups  based  on  high  or  low  prior  year 
cost,  prior  hospital  use  and  presence  of  selected 
conditions  in  the  prior  year.     The  formula  for  the 
predictive  ratio  is  mean  predicted  value/mean  actual 
value  for  the  group   (Ash  1989) . 

Five  models    (an  AAPCC  proxy,   two  specific  condition 
models,   the  ADG/HOSDOM  and  ADG/MDC  models)   are  compared 
using  four  evaluation  criteria   (predictive  accuracy, 
administrative  feasibility,   incentives  for  cost 
effectiveness  and  gameability)   to  determine  whether 
specific  condition  models  are  viable  alternatives  to  the 
current  AAPCC  risk  adjustment  model.  The  standard  applied 
here  for  determining  whether  specific  condition  models 
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are  viable  alternatives  to  the  AAPCC  is  that  they  must 
outperform  the  AAPCC  and  be  at  least  comparable  to  the 
ADG/HOSDOM  and  ADG/MDC  models  for  these  four  evaluation 
criteria . 

DATA  SOURCES 

All  data  used  in  this  study  is  extracted  from  1991 
and  1992  HCFA  Medicare  enrollment  and  claims  data  files. 
Data  is  extracted  from  HCFA' s  Standard  Analytic  Claims 
File  for  Part  A  and  Part  B  claims.     Claims  for  inpatient, 
outpatient  facility,   skilled  nursing  facility,  TEFRA 
facilities,  physician  and  other  medical  services,  home 
health  and  hospice  care  are  included.     Enrollment  data  is 
extracted  from  the  Medicare  Common  File  and  the  HISKEW 
file.     File  descriptions  are  included  in  the  appendices. 

ANALYSIS  FILE  CONSTRUCTION 

Project  teams  at  Johns  Hopkins  University   (JHU)  in 
Baltimore  and  Health  Economics  Research  Inc.    (HERI)  in 
Boston  developed  several  analytic  files  that  were  shared 
between  their  projects  and  with  this  research  effort 
(Ellis,   1996;  Weiner,   1996) .     HERI  extracted  data  from 
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the  common  file  and  created  the  development  and  - 
validation  samples  used  by  HERI,  JHU  and  this  project. 
HERI  and  JHU  created  all  dependent  and  independent 
analytic  variables  employed  in  the  AAPCC  proxy  (basic 
socio-demographic) ,  ADG/HOSDOM  and  ADG/MDC  models  (Weiner 
1996) .     This  research  leverages  the  sampling  design  and 
analytic  files  from  the  HERI  and  JHU  projects  for 
purposes  of  developing  and  validating  the  effectiveness 
of    a  different  family  of  models,   specific  condition  risk 
ad j  us  tmen t  mode 1 s . 

All  independent  variables  in  the  specific  condition 
models,   described  in  detail  below,  were  developed  in  the 
course  of  this  dissertation  research.     The  specific 
condition  variables  derived  here  are  comprised  of  ICD-9- 
CM  diagnosis  codes  extracted  from  the  same  claims  files 
used  by  JHU  and  HERI  to  develop  independent  variables  for 
their  respective  models.     An  ICD-9-CM  code  validation 
table  developed  at  JHU  to  edit  ICD-9-CM  codes  for  the  JHU 
project  was  used  to  edit  ICD-9-CM  codes  extracted  by  this 
project. 

There  is  a  methodological  difference  between  the 
ICD-9-CM  code  claims  extraction  performed  in  this  and  the 
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JHU  project.     This  project  includes  all    valid  ambulatory 
ICD-9-CM  codes  and  inpatient  diagnosis  codes  including 
discharge  and  admission  diagnoses.     The  JHU  project 
includes  diagnosis  codes  from  ambulatory  and  inpatient 
claim  discharge  diagnoses  but  specifically  excludes 
inpatient  admission  diagnoses  because  the  project  team 
believed  inpatient  discharge  diagnoses  are  more  accurate 
than  admission  diagnoses  and  could,  therefore,   lead  to 
inaccurate  risk  classification   (Weiner  1995,   1996) . 
Laboratory  claim  diagnoses  are  also  excluded  from  ADG 
assignment  in  the  JHU  project  out  of  concern  that 
categorizing  patients  based  on  lab  tests  could  include 
rule -out  diagnoses  and  make  gaming  the  system  much  easier 
(Weiner  1995,    1996) . 

Problems  of  inaccurate  classification  and  increased 
opportunity  for  gaming  are  important  issues  especially 
for  the  family  of  complex/indirect  claims-based  models 
with  grouping  algorithms  that  combine  broad  ranges  of 
diagnoses  in  one  risk  class.     The  importance  of  these 
issues  decrease  in  simple/direct  claims-based  models 
because  risk  classes  are  less  broad,  easier  to  audit  and. 
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like  problems  with  upcoding,   can  be  mitigated  by 
periodically  recalibrating  the  model. 

It  is  possible  that  increased  rule-out  diagnoses  are 
indicative  of  more  complex  health  problems  that  require 
more  extensive  use  of  health  services  for  diagnosis  and 
treatment.     Rule-out  diagnoses  may  also  indicate  a 
greater  propensity  to  use  the  health  care  system  or 
increased  costs  of  "defensive  medicine" .     It  is  also 
possible  that  although  inpatient  admission  diagnoses  may 
not  be  as  accurate  as  discharge  diagnoses  in  representing 
the  true  morbidity  status  of  a  patient,   they  may  reflect 
the  initial  course  of  inpatient  treatment  and  their 
omission  may  result  in  underestimating  hospital  costs. 
These  last  two  issues  may  be  more  applicable  to  and 
important  considerations  for  simple/direct  rather  than 
complex/indirect  claims-based  models. 

The  choice  to  include  admission  and  lab  diagnoses 
here  is  based  on  the  concept  that  all  diagnoses  reflected 
in  claims  are  related  to  treatment  or  services  received 
and,  therefore,  to  charges  and  payments.  Thus,  this 
project  casts  a  broader  net  when  including  ICD-9-CM  codes 
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to  define  specific  conditions  than  the  JHU  project  does 
for  ADG  assignment. 

SAMPLE  DESIGN 

The  standard  Medicare  5%  sample   (N=l,372,047  for 
1991)   formed  the  initial  sampling  frame  for  this 
research.     The  standard  5%  sample  comprises  aged, 
disabled  and  ESRD  Medicare  beneficiaries  who  have  a 
social  security  number  where  the  last  two  digits  are 
equal  to  any  1  of  5  two  digit  numbers  that  define  the 
standard  sample.     This  sampling  process  was  initiated  by 
the  HCFA  to  establish  a  standard  sample  of  data  for  use 
in  research  for  program  management,  evaluation  and 
development . 

This  study  includes  persons  from  the  standard  5% 
sample  age  65  or  over,   continuously  enrolled  in  both 
Parts  A  and  B  of  Medicare  during  1991  and  alive  at  year 
end  1991.     Persons  with  End  Stage  Renal  Disease  (ESRD), 
the  disabled  under  age  65  and  those  enrolled  in  an  HMO 
are  excluded  as  are  Railroad  Board  or  Indian  Health 
Seirvice  beneficiaries . 
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Disabled  persons  under  65  or  persons  with  ESRD  are 
excluded  because  the  nature  of  their  conditions  and  their 
cost  profiles  differ  markedly  for  the  majority  of 
Medicare  enrollees,   the  aged  (Weiner,   1996)  .     Persons  who 
did  not  have  both  part  A  and  B  coverage  or  who  are  in 
HMOs  or  who  are  only  partially  eligible  for  coverage 
during  1991  due  to  death  or  attaining  eligibility  age  are 
excluded  because  base  year  data  for  calibrating  the  risk 
model  is  incomplete.     Inclusion  of  these  subpopulations 
in  the  study  would  bias  adjustment  weights  for  each  of 
the  reasons  cited.     Beneficiaries  meeting  these  study 
criteria  are  retained  from  the  split  random  samples 
developed  by  HERI  yielding  a  development  sample  size  of 
618,418  and  validation  sample  size  of  618,  772.  These 
exclusions  are  made  after  HERI  randomly  allocated  the 
full  5%  sample  to  two  split  samples.     Therefore,  maximum 
likelihood  chi  square  and  analysis  of  variance  F  tests  of 
sample  homogeneity  are  performed  on  dependent  and 
independent  variables  for  all  models  to  determine  whether 
bias  is  introduced  to  the  samples  as  a  result  of  this 
process  or  by  chance  during  random  allocation. 
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Split  random  sample  sizes  in  this  study  (618,418  and 
618,772)  yield  statistical  power  for  the  regression  model 
F  test  from  .86-. 91     given  alpha  <   .01,  a  model  with  30- 
40  independent  variables  and        =   .0725   (Cohen  J,  1988). 
Statistical  power  of  this  magnitude  means  that  the  null 
hypothesis  of  no  linear  relationship  between  the 
independent  variables  and  the  dependent  variable  may  be 
rejected  with  a  relatively  low  chance  of  committing 
either  Type  I  error   (rejecting  a  true  null)   or  Type  II 
error     (failing  to  reject  an  hypothesis  when  it  is 
actually  false)    (Blalock,   1972;  Cohen,  1988) . 

Split  samples  are  used  to  test  stability  and 
generalizability  of  the  resulting  risk  mode.  Stability 
is  tested  by  comparing  parameter  estimates  for  each 
sample.       Generalizability  is  assessed  by  applying  beta 
weights  calibrated  using  the  development  sample  to 
predict  the  validation  sample  and  calculating  predictive 
ratios.     If  predictive  accuracy  is  high  (within  5%  of 
1.0)   then  the  model  has  acceptable  external  validity. 
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DEPENDENT  VARIABLES 

Three  dependent  variables  are  derived  from  claims 
data.  An  adjusted  total  Medicare  payment  for  1992  is  the 
primary  dependent  variable.     The  second  variable, 
representing  a  $100,000  reinsurance  level,   is  the 
adjusted  1992  total  payment  truncated  at  $100,000  (i.e., 
all  cases  over  100,000  in  payments  are  set  to  100,000)  . 
The  third  variable,  representing  a  $50,000  reinsurance 
level,   is  an  adjusted  1992  total  payment  truncated  at 
$50, 000 . 

At  the  time  of  this  research  HCFA  is  examining  what 
reinsurance  levels   (e.g.   $100,000  or  $50,000)  might  be 
most  appropriate  if  HCFA  were  to  offer  a  reinsurance  pool 
to  participating  plans   (HCFA  1994)  .     Therefore  these 
levels  are  chosen  for  this  research.       A  $50,000  level  is 
offered  to  plans  selected  for  participation  in  the 
"Choices"  Demonstration  Project   (HCFA  Choices  RFP,   1995)  . 

The  specific  method  for  calculating  the  primary 
dependent  variable,  total  adjusted  1992  expenditures, 
used  in  this  and  the  JHU  and  HERI  studies  is  determined 
jointly  by  HCFA-ORD,  JHU  and  HERI  for  their  projects. 
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Actual  reimbursement  amounts  are  used  for  outpatient 
facilities,   skilled  nursing  facilities,  TEFRA. 
institutions  exempt  from  prospective  DRG  payments, 
hospice  and  home  health  services,   and  physician  services 
where  no  RBRVS  amounts  are  assigned.     Where  RBRVS  units 
are  assigned  the  dependent  variable  includes  RBRVS 
amounts  adjusted  by  the  Geographic  Physician  Service 
Indicator.     Inpatient  expenditures  are  derived  from  DRG 
payments  plus  outlier  and  capital  payments .  The 
dependent  variable  reflects  the  sum  of  all  the  above  for 
each  included  beneficiary  for  each  study  year  with  all 
expenditures  inflation  adjusted  to  1992  dollars.  Patient 
co-payments,   deductibles  and  out-of-pocket  costs  for 
services  not  covered  by  Medicare  are  excluded  from  the 
dependent  variable.    (Weiner,  1996). 

The  choices  made  by  HCFA,  JHU  and  HERI  on  how  to 
calculate  the  dependent  variable  are  the  most  appropriate 
choices  for  risk  modeling  of  Medicare  capitation  payments 
because  they  are  the  best  possible  reflection  of  costs 
incurred  by  the  Medicare  program.  Medicare 
reimbursement    (payments)  not  charges  or  costs  are 
modeled.     Payments  are  calculated  using  Medicare  specific 
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payment  policy  amounts   (e.g.  maximum  allowable  fees, 
RBRVS  fees  for  providers  and  DRG  prospective  payments)  . 
This  method  of  calculation  is  more  relevant  for  Medicare 
program  management,   capitation  rate  setting  and  risk 
adjustment  than  alternate  measures  of  health  care  costs 
such  as  provider  charges  or  estimation  of  total  social 
cost  of  care  for  the  aged  because  it  reflects  the  program 
costs  for  benefits  and  services  covered  which  must  be 
included  in  Medicare's  base  capitation  rate. 

Adjusting  the  dependent  variable  for  month  of  death 
standardizes  the  impact  of  mortality  on  adjustment 
weights  calibrated  in  the  risk  models   (Ash,   1989;  Ellis, 
1996;  Weiner,   1996) .     First,   the  total  payment  for 
persons  who  die  in  1992  is  inflated  using  the  formula 
((TOTAL  PAYMENT /MONTHS  ALIVE)    *  12)  .     That  observation  is 
weighted  downward  by  proportion  of  the  year  alive  (Months 
Enrolled/12)  when  adjustment  weights  are  calibrated. 
This  process  allows  costs  that  accrue  near  death  to  be 
reflected  in  the  adjustment  weights  while  reducing  the 
bias  that  could  occur  if  these  cases  are  not  standardized 
in  some  fashion. 
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INDEPENDENT  VARIABLES 

The  five  risk  adjustment  models  examined  here  are  an 
AAPCC  proxy,    two  ACG/PACS  derivative  models  (ADG/HOSDOM 
and  ADG/MDC)   and  two  models  with  comprised  of  30  specific 
conditions   (SPEC  I  without  and  SPEC  II  with  dummy 
variables  for  number  of  inpatient  stays) .  Three 
categories  of  independent  variables  apply  to    five  models 
compared  here;   socio -demographic  variables,  ACG/PACS 
derived  variables,  and  specific  condition  variables. 

Specific  condition  risk  adjustment  variables  are 
presented  first.     Specific  condition  operational 
definitions,   development  of  specific  condition  risk 
models  and  assessing  the  viability  of  those  models  for 
Medicare  risk  adjustment  are  original  contributions  made 
by  this  study  to  health  services  research.     A  discussion 
of  other  variables     comprising  the  comparison  models 
follows . 

Specific  condition  variables  are  comprised  of  ICD-9- 
CM  diagnosis  codes  present  in  hospital  inpatient, 
outpatient,   physician  services,  hospice,  home  health, 
skilled  nursing  and  TEFRA  facility  claims.     A  detailed 


118 


discussion  of  the  process  and  criteria  for  defining  these 
predictors  follows  the  7  steps  outlined  below. 

1)  Accept  the  level  of  clinical  cogency  present  in 
the  ICD-9-CM  nosology. 

2)  Choose  a  level  in  the  ICD-9-CM  nosology  that 
strikes  a  balance  between  the  need  for  model 
parsimony  and  for  sufficient  prevalence  of  those 
conditions  to  ensure  statistical  reliability  and 
validity  of  the  estimates   (i.e.  the  second  highest 
aggregation  level  in  the  ''List  of  Three-Digit 
Categories" ) . 

3)  Regress  Year  2  expenditures  on  Year  1  conditions. 

4)  Retain  conditions  with  positive  estimates, 
significant  at  .001  with  costs  greater  than  $250. 

5)  Perform  an  empirical  review  of  remaining 
predictors  to  determine 

which  could  be  collapsed  given  similar  cost 
levels  and  clinical  cogency  as  implied  by  the 
ICD-9-CM  classification  structure  and 

which  could  be  disaggregated  based  on  high 
cost,   sample  size  and  distribution  of  5 -digit 
ICD-9-CM  codes  within  the  initial  operational 
definitions  of  predictors. 

6)  Perform  a  stepwise  regression  with  all  predictors 
defined  using  the  largest  increase  in  adjusted  R2  as 
the  variable  selection  criteria,  requesting  the  best 
35  condition  model. 

7)  Review  results  keeping  four  sociodemographic 
variables  and  30  conditions  that  meet  the  following 
criteria:  positive  parameter  estimates,   cost  >  $250, 
statistically  significant  T  test  <  .001, 

policy  relevance,   clinical  cogency  and  proxies 
for  severity,   frailty  and/or  limitations  in 
activities  of  daily  living. 
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The  process  begins  by  simply  accepting  the  I-CD-9-CM 
nosology  as  defined.  The  International  Classification  of 
Diseases,  version  9,  with  clinical  modifications  (ICD-9- 
CM)   is  the  standard  diagnostic  nosology  employed 
nationally  for  diagnostic  coding  of  claims  and  medical 
records.     The  ICD  classification  was  originally  piiblished 
in  1948  by  the  World  Health  Organization  for  tracking  and 
comparing  morbidity  and  mortality  across  nations.  The 
National  Center  for  Health  Statistics  developed  clinical 
modifications  to  the  9th  edition  of  the  ICD  published  in 
1977.     These  modifications  delineate  patient  clinical 
status  enabling  use  of  ICD- 9  codes  for  improved  morbidity 
classification  and  medical  care  reviews.     In  1988 
Congress  mandated  use  of  ICD-9-CM  codes  for  all  Medicare 
claims  effective  April  1,   1989.     This  mandate  enhanced, 
but  did  not  ensure,   completeness  of  diagnostic  coding  on 
Medicare  claims   (1991  ICD-9-CM  Coding  Manual) . 

Existing  and  emergent  medical  care  review  processes, 
case  mix  measures  and  risk  adjustment  models  increasingly 
rely  on  ICD-9-CM  diagnosis  coding  in  administrative  data 
sources  because  it  is  relatively  cost  effective,  data 
quality  has  been  improving  over  time,  and  the  ICD- 9 -CM 
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nosology  has  an  inherent  level  of  clinical  face  validity 
(Ellis,   1996;   Palmer,   1994;  Weiner,   1996;  ICD-9-CM 
Manual,   1991).     Even  though  use  of  diagnoses  from  claims 
data  for  various  purposes  has  been  criticized  for 
incompleteness,   inaccuracies  etc.    (Anderson,  1990; 
Connel,    1987;  Ellis,    1996;  Fowles,   1994;  Hsai ,  1988; 
Lave,   1994;  Mithcell,   1994;  Romano,   1992;  Weiner,  1990 
1996) ,   expanding  use  of  claim  diagnosis  codes  in  multiple 
applications  increases  confidence  in  the  reliability  and 
completeness  of  this  data. 

ICD-9-CM  diagnosis  codes  recorded  on  paid  Medicare 
claims  of  all  types  are  used  to  indicate  presence  or 
absence  of  specific  conditions  in  this  study.  For 
hospital  and  facility-based  outpatient  claims  in  the  UB82 
claim  format,  all  valid  admission  diagnoses  and  first  10 
discharge  diagnoses  listed  on  claims  are  used  in 
constructing  specific  condition  variables.     For  physician 
claims  submitted  on  the  HCFA  1500  format,   all  valid 
header  and  line  item  diagnoses  on  the  claim  are  also  used 
in  constructing  specific  condition  variables.     In  1988, 
Congress  mandated  that  by  198  9,  physicians  must  enter  at 
least  one  diagnosis  in  the  top   (header)  portion  of  the 


121 


claim  indicating  the  primary  reason  for  the  visit-. 
Congress  also  mandated  that  each  procedure  submitted  for 
payment  in  the  claim  detail  line  item  area  must  have  an 
associated  diagnosis.     If  a  beneficiary  has  a  least  1 
valid  diagnosis  code  on  any  1991  UB82  facility  or  HCFA 
1500  medical  service  claim  within  the  range  of  codes 
defining  a  specific  condition,   then  that  condition  is 
coded  as  present  for  that  beneficiary. 

The  second  step  in  the  process  entails  choosing  a 
level  in  the  ICD-9-CM  hierarchy  of  conditions  that 
strikes  a  balance  between  the  need  for  model  parsimony 
and  for  sufficient  prevalence  of  those  conditions  to 
ensure  statistical  reliability  and  validity  of  the 
estimates .     The  111  disease  groups  at  the  second  highest 
level  of  aggregation  within  the  list  of  three-digit 
categories  are  much  more  specific  than  the  16  highest 
level  diagnostic  categories  and  much  more  parsimonious 
than  the  full  list  of  3 -digit  codes  or  any  lower  level 
(e.g.  4  or  5  digits)    in  the  code  list.     Starting  with  a 
predefined  set  of  111  disease  groups  designed  for  a 
variety  of  clinical,   health  research  and  billing  purposes 
is  a  logical  and  efficient  first  step  in  use  of  diagnosis 
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codes  for  risk  assessment.     The  list  of  111  categories  is 
then  examined  and  selected  groupings  eliminated  or 
collapsed  depending  on  relevance  to  the  Medicare  aged 
population,   concern  over  prevalence,   interest  in  effects 
of  iatrogenic  illness  and  interest  in  a  severity  of 
illness  proxy  measure  for  cancer  diagnoses .  This 
process,   described  in  detail  below,  resulted  in  reducing 
111  categories  to  an  initial  list  of  98  conditions. 

More  specifically,   3  disease  categories  related  to 
"Complications  of  Pregnancy",   1CD-9-CM.  code  range  760- 
77999,  are  eliminated  from  the  original  list  of  111. 

Other  second  level  condition  groupings  are  collapsed 
at  the  outset  due  to  concerns  over  potentially  low 
prevalence.     Twenty- four  "Injury  and  Poisoning" 
categories  are  collapsed  into  two  variables:  one  for 
fractures    (800-82999) ,   and  one  for  all  other  injury  and 
poisoning  codes    (830-99599)  .     Complications  of  medical 
and  surgical  care  not  elsewhere  classified  (996-99999) 
are  collapsed  with  three  categories  of  E  codes  (E870-E876 
"Misadventures  to  patients  during  surgical  and  medical 
care",  E878-E879  "Surgical  and  medical  procedures  as  the 
cause  of  abnormal  reaction  ...   or  later  complication"  and 
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E93  0-E94  9  "Drugs,  medicinal  and  biological  substances 
causing  adverse  effects  in  therapeutic  use")     in  order  to 
examine  the  effect  of  iatrogenic  illness  on  subsequent 
year  costs  and  utilization.     All  other  E  codes  are 
collapsed  into  one  variable.     All  V  codes  are  collapsed 
into  four  variables   (Personal  history  of  Cancer,  Mental 
Health  Conditions,   any  transplant  and  all  other  V  codes) 
due  to  concern  for  prevalence  of  and  an  interest  in 
examining  history  variables  for  those  three  typically 
high  cost  conditions. 

In  addition  to  three-digit  categories  representing 
neoplasms  by  body  system,   four  composite  neoplasm 
variables  are  constructed  without  regard  to  site  as 
proxies  for  severity  or  stage  of  cancer.     Those  composite 
variables  are  primary  malignancy,   secondary  malignancy, 
an  interaction  term  for  primary  and  secondary 
malignancies  and  benign  neoplasms.     Presence  of  a 
secondary  malignancy  or  interaction  with  a  primary 
malignancy  suggests  later  stage  or  high  severity  cancers. 
Benign  neoplasms  reflect  the  lowest  severity  of  neoplasm. 
The  primary  malignancy  variable  includes  malignant 
melanoma  of  skin   (172)  but  excludes  other  malignant 
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neoplasm  of  skin   (173)   such  as  basal  and  squamous  cell 
malignancies  which  have  a  high  prevalence  and  low  cost 
relative  to  other  cancers.     Prior  experience  with  cancer 
research  guided  these  decisions   (Campbell  1994) . 

Regressing  1992  costs  on  condition  variables  defined 
as  described  above  initiates  the  third  step.  The 
resulting  regression  is  shown  in  the  appendix. 

The  fourth  step  entails  retaining  all  conditions 
with    positive  estimates,     significant  at  the  0.001  level 
with  costs   (i.e.  effect  size)   greater  than  $250.     The  p 
<=  0.001  level,  a  very  conservative  rejection  level  for 
the  samples  sizes  in  this  study,   is  used  to  minimize  Type 
I  error  for  the  parameter  T  test   (e.g.,  rejecting  a  true 
null  hypothesis  that  a  specific  condition  has  no  effect 
on  subsequent  year  costs) .     Such  a  conservative  choice 
maximizes  the  likelihood  that  specific  conditions 
retained  in  the  model  are  indeed  predictors  of  subsequent 
year  costs  while  increasing  the  risk  of  Type  II  error 
(failure  to  reject  a  false  null) .     However,  increasing 
Type  II  error  is  not  critical  since  effects  of  omitted 
conditions  are  absorbed  in  the  intercept  term     (i.e.  the 
average  or  base  capitation  rate  for  the  population) . 
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Because  the  primary  objective  of  a  capitation  ris-k 
assessment  model  is  predictive  accuracy  at  the  group 
level,  especially  for  groups  biased  with  respect  to 
factors  that  could  be  known  to  health  plans  or  patients 
(e.g.  specific  morbidities,  impending  high  use)  that 
might  induce  selection  bias,  it  is  most  important  to 
isolate  the  strongest  predictive  factors  particularly 
since  the  effects  of  weaker  predictors  omitted  due  to 
Type  II  error  are  not  lost  but  rather  reflected  in  the 
population  mean  or  intercept. 

Four  interrelated  parameters  effect  the  statistical 
validity  of  a  statistical  test : 

1)  the  power  of  the  test,    1-P,  where  P  is  the  level 
of  Type  II  error   (e.g.  P=.20), 

2)  region  of  rejection  or  a  level    (e.g.  a=p=.001), 

3)  the  sample  size  n   (higher  n  increase  power) 

4)  effect  size  such  as  parmeter  estimates  of  risk 
factors   (larger  effects  increase  Power)  (Cohen, 
1983)  . 

"Mathematically,  any  one  of  these  is  determined  by 
the  other  three"   (Cohen,   1993) .     In  the  sample  discussion 
above  we  noted  statistical  power  for  the  full  regression 
model  F  test  ranges  from  .86 -.91    given  alpha  <   .01,  a 
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model  with  30-40  independent  variables  and  effect  size  or 

=   .0725    (Cohen  J,   1988).       Statistical  power  of  this 
magnitude  means  that  the  null  hypothesis  of  no  linear 
relationship  between  all  independent  variables   (the  full 
model)   and  the  dependent  variable  may  be  rejected  with  a 
relatively  low  chance  of  committing  either  Type  I  (alpha 
<   .01)   error   (rejecting  a  true  null)   or  Type  II   (beta  < 
.14)   error     (failing  to  reject  an  hypothesis  when  it  is 
actually  false)    (Blalock,  1972;  Cohen,  1988) . 

Although  including  predictors  with  negative 
parameter  estimates  significant  at  p  <  .001  would 
increase  the  predictive  accuracy  of  a  risk  model  at  the 
lower  end  of  the  distribution  by  decreasing  the 
population  mean  for  less  costly  risk  factors,   it  would 
also  create  financial  disincentives  for  cost  effective 
care  and  increase  incentives  for  upcoding  to  avoid 
reduction  of  base  capitation  rates.     Therefore,  negative 
predictors  are  omitted  from  the  model  for  policy  reasons. 

Fifty  eight  predictors  attained  positive  estimates 
statistically  significant  at  p=0.001. 

At  this  point,   a  preliminary  test  of  specific 
condition  model  face  validity  was  conducted  using  30  of 
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the  59  variables  with  the  highest  parameter  estimates 
retained  in  step  4 .     Results  were  reviewed  by  two 
physicians.  Dr.  Neil  Powe  and  Dr.  Albert  Wu,   to  assess 
the  face  validity  of  that  model.     While  neither  physician 
endorsed  the  model  as  having  perfect  clinical  face 
validity,  both  felt  the  conditions  and  parameter 
estimates  had  face  validity  at  least  equivalent  to  other 
models  and  suggested  two  ways  clinical  validity  might  be 
improved;  1)  reexamining  composition  of  each  variable  to 
ensure  they  it  is  clinically  precise  and  credible 
combination  of  ICD-9-CM  codes  2)   employing  stepwise 
regression  to  determine  which  conditions  make  the 
greatest  contribution  to  R2 .     These  suggestions  were 
incorporated  in  steps  five  and  six  of  the  modeling 
process . 

Of  the  59  variables  retained  from  the  regression  in 
Step  3,   7  had  high  multicolinearity  as  indicated  by 
variance  inflation  factors  greater  than  4.00    (Fox  1991). 
The  formula  for  the  variance  inflation  factor  is  1/(1- 
R2j)   where  R2j    is  the  proportion  of  variance  of  any  given 
IV  explained  when  regressed  on  all  other  IVs  in  the 
model.     The  numerator  1-R2   (Tolerence)j  reflects  the 
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amount  of  variance  in  each  IV  unique  to  that  IV  that 
could  explain  variance  in  the  dependent  variable  in  our 
risk  model,   health  expenditures.     If  that  variance 
remaining  is  less  than  .25    (1-.75)   then  the  variance 
inflation  factor  will  be  1/.25  or  4.0,   the  cut-off  point 
for  rejecting  IVs  because  there  is  too  much  colliniarity 
with  other  variables . 

All  variables  above  the  VIF  limit  4 . 0  were  expected 
given  inclusion  of  composite  variables  for  malignant 
neoplasms,  psychological  conditions  and  conditions 
related  to  diseases  of  the  musculoskeletal  system  and 
connective  tissue.     All  three  composite  variables  are 
retained,   including  malignant  neoplasms  which  failed 
other  selection  criteria,   as  a  means  of  modeling  average 
costs  for  these  composite  condition  groups  and  allowing 
both  clinically  and  statistically  significant  higher  cost 
conditions  within  the  composite  to  emerge  through  the 
modeling  process.     Fifty  nine  variables  are  retained  for 
consideration  in  the  next  step  of  the  process. 

The  fifth  step  of  the  process  entails  an  empirical 
review  of  remaining  predictors  to  determine  which  could 
be  collapsed  given  similar  cost  levels  and  clinical 
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cogency  as  implied  by  the  ICD-9-CM  classification 
structure  and  which  could  be  disaggregated  based  on  high 
cost,   sample  size  and  distribution  of  5-digit  ICD-9-Ciyi 
codes  within  the  initial  operational  definitions  of 
predictors . 

Of  the  59  variables  retained,   three  parameter 
estimates  with  high  costs   (i.e.  >  $1500)   that  are  not 
homogenous  with  respect  to  the  types  of  sub- conditions 
contained  in  their  group  are  further  investigated  to 
determine  if  more  specific  condition  definitions  are 
appropriate.     Those  conditions  are  "other  bacterial 
disease"    (030-04199) ,    "all  nutritional  deficiencies" 
(260-26999)   and  "all  external  agent  lung  disease"  (50- 
50899)  .     Frequency  distributions  of  5  digit  ICD-9-CM 
codes  within  these  three  categories  are  examined  to 
determine  whether  clusters  of  more  specific  diseases 
account  for  high  costs.     Within  "other  bacterial  disease" 
two  conditions  emerged;   septicemia   (038-03899)  and 
bacterial  infections  in  conditions  classified  elsewhere 
(041-04199) .     Three  additional  dummy  variables  are 
created  based  on  this  finding;  septicemia  only,  bacterial 
infections  in  conditions  classified  elsewhere  and  both 
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combined  excluding  all  other  types  of  bacterial  ' 
infections.     Within  all  nutritional  deficiencies, 
protein-calorie  malnutrition   (263-26399)  emerged. 
Within  all  external  agent  lung  disease,  disease  due  to 
solids  and  liquids   (507-50799)   is  the  most  prevalent. 

Knowledge  of  concern  among  Medicare  peer  review 
organizations   (PRO)    (gained  while  employed  as  a 
consultant  at  one  PRO)   over  the  prevalence  of  decubitus 
ulcers   (bed  sores)   led  to  development  of  a  separate 
variable  for  that  condition  in  recognition  of  its 
potential  as  an  indicator  of  overall  severity  of  illness 
since  bed  sores  typically  occur  only  in  patients  who  are 
so  frail  they  are  bed- ridden.     Interest  in  examining 
whether  different  types  of  heart  conditions  had  different 
effects  on  costs  motivated  creation  of  five  heart 
condition  variables;  Any  heart  condition,   CHF  only,  other 
heart  disease  excluding  CHF,   Ischemic  heart  disease  and 
pulmonary  circulation  disease  combined,  and  chronic 
rheumatic  heart  disease  combined  with  other  heart 
disease . 

Table  4.1  contains  operational  definitions  for  all 
specific  condition  variables  defined  by  this  process. 
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Dummy  predictor  variables    (O=condition  not  present, 
l=condition  present)   are  created  for  the  initial  list  of 
conditions  described  above   (e.g.   Intestinal  and 
Infectious  Diseases  001-009,  Tuberculosis  010-018, 
Zoonotic  Bacterial  Diseases  020-027  etc.).     Presence  of  a 
least  1  valid  diagnosis  within  the  range  given  in  this 
table  for  each  specific  condition  dummy  variable  results 
in  that  condition  being  coded  as  present  for  that 
beneficiary  during  1991.     The  full  set  of  variables 
defined  as  a  result  of  steps  1-5  are  used  in  the  next 
step . 

The  sixth  step  of  the  process  is  a  stepwise 
regression  with  all  predictors  defined  in  steps  1-5  plus 
the  four  sociodemographic  variables  included  in  the  AAPCC 
and  all  comparison  models.  Variable  selection  criteria 
for  the  regression  is  the  largest  increase  in  adjusted 
R2 .     The  best  35  condition  model  is  chosen,   apriori,   as  a 
stopping  point  because,   after  applying  final  selection 
criteria  (positive  effect  greater  than  $250  at  p  <  .001), 
it  would  yield  a  model  with  degrees  of  freedom  comparable 
to  the  ADG/MDC  comparison  model. 
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The  resulting  best  30-33  factor  models,   shown  in  the 
appendix,   are  examined  in  the  last  step  of  the  process. 
Here  the  following  criteria  for  including  variables  in 
the  final  model  are  applied;  positive  parameter  estimates 
over  $250,  policy  relevance,  clinical  cogency  and  an 
interest  in  retaining  any  predictors  that  are  proxies  for 
degree  of  severity,   frailty  and/or  limitations  in 
activities  of  daily  living  which,   as  Gruenberg  suggests 
(1996)  ,     may  enhance  predictive  accuracy  of  claims -based 
specific  condition  models.     This  review  results  in 
removal  of  one  condition  with  a  negative  parameter 
estimate,   lipoid  metabolic  disorders,   and  inclusion  of 
one  sociodemographic  variable.   Medicaid  buyin,  which  had 
not  yet  entered  the  modeling  process  but  should  be 
included  to  be  consistent  with  current  Medicare  policy  as 
reflected  by  the  AAPCC. 

The  specific  condition  model  shown  in  Table  4.2 
(SPEC  I)   lists  30  conditions  selected  for  the  final  model 
in  addition  to  the  AAPCC  variables.     Table  4.3  is  the 
specific  condition  model    (SPEC  II)   with  additional  dummy 
variables  for  number  of  inpatient  stays  during  1991. 
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Comparison  models,   summarized  in  Tables  4. 4 -4. 6  will 
now  be  described. 

The  AAPCC  proxy  model  has  four  socio- demographic 
variables .  Each  of  the  comparison  models  incorporates 
these  predictors.     Socio-demographic  variables  include 
one  continuous  variable   (years  of  age  over  65  ranging 
from  0  to  a  maximum  of  34  for  age  99  and  over)   and  3 
dummy  variables   (gender.  Medicaid  eligibility  and  ever 
having  qualified  for  Medicare  disability  coverage) .  The 
JHU  project  tested  a  variety  of  socio-demographic 
variables  including  age/gender  interactions.  These  four 
variables  are  either  the  best  predictors  or  most 
consistent  with  the  current  AAPCC  adjusters  and  HCFA 
policy.    (LEWIN  1996)  .     These  variables  comprise  the  AAPCC 
proxy  model  in  this  study  shown  in  Table  4.2. 

The  ACG/PACS  derivative  models  are  chosen  for 
comparison  with  specific  condition  models  for  several 
reasons.     First,  ACGs  and  PACS  are  well  known  risk 
adjustment  models.     Second,   they  are  viable  alternatives 
to  the  AAPCC  as  evidenced  by    their  consideration  for 
application  in  the  Choices  Demonstration  (HCFA  Choices 
RFP  1995)  .       As  such,    the  ADG/HOSDOM  and  ADG/MDC  models 
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serve  as  a  criterion,  if  not  gold,   standard  for  assessing 
the  viability  of  other  families  of  risk  models,   such  as 
the  specific  condition  models  developed  here.  Finally, 
physical  proximity  to  the  JHU  project  team  suggested 
communications  with  the  project  team  and  data  access 
might  be  more  easily  accomplished  than  with  researchers 
at  another  site. 

The  ADG/HOSDOM  model  is  one  of  two  chosen  from 
several  models  tested  in  the  JHU  project  (Lewin-VHI 
1996).     The  ADG/HOSDOM  variables,   listed  in  table  4.5, 
are  all  dummy  variables   (l=present ,  0=absent )  .     The  HOSDOM 
variable  indicates  a  beneficiary  had  at  least  1  diagnosis 
recorded  on  any  type  of  claim  that  has  a  high  (50%  or 
more)  probability  of  requiring  a  hospital  admission.  The 
remaining  dummy  variables  are  selected  ambulatory 
diagnostic  group  derived  from  all  claim  sources  using  the 
JHU  ACQ  ICD-9-CM  grouping  scheme  expanded  to  incorporate 
ICD-9-CM  codes  particularly  relevant  for  the  Medicare 
aged  population  (e.g.  neoplasm  diagnoses).     Criteria  for 
the  ADG  grouping  algorithm  are  discussed  in  the  previous 
chapter.     Additional  details  are  reported  in  the  final 
research  report  to  HCFA  by  the  JHU  project   (Lewin  1996, 
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Weiner  1996)   and  in  JHU' s  Physician's  Guide  to  ACGs. 
Extracts  from  that  guide  are  included  in  the  appendix. 

The  ADG/MDC  model  is  the  second  model  recommended  by 
the  JHU  project  and  is  also  discussed  in  detail  in  the 
previous  chapter.      ADG/MDC  independent  variables,  listed 
in  Table  4.6,   are  a  mix  of  dummy  and  continuous 
variables .     The  first  half  of  the  model  contains  dummy 
variables  representing  the  same  set  of  selected  ADG 
groups  used  in  the  ADG/HOSDOM  model,  but  are  derived 
only  from  ICD-9-CM  diagnosis  codes  recorded  on  face-to- 
face  physician  claims  only,  not  all  claims  as  in  the 
ADG/HOSDOM  model .     The  second  half  of  the  model  contains 
continuous  variables  reflecting  the  number  of  admissions 
in  1991  where  the  primary  diagnosis  for  the  stay  falls 
within  that  particular . Maj or  Diagnostic  Category  (MDC) . 
The  JHU  modeling  effort  selected  the  continues  MDC 
variables  after  testing  dummy  variables  for  each  of  these 
categories  and  several  other  indicators  for  numbers  of 
admissions  including  the  three  dummy  variables  included 
in  the  specific  condition  model  with  admissions.  (Lewin, 
1996;  Weiner,   1996) . 
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The  final  ADG/HOSDOM  and  ADG/MDC  models  shown  in 
Tables  4.5  and  4.6  represent  the  best  models  tested  in 
the  JHU  project   (Lewin  1996,  Weiner  1996)   and  are 
intended  for  use  by  health  plans  participating  in  the 
Medicare  "Choice"  Demonstration  (HCFA  Choices  RFP,   1995) . 

MODEL  EVALUATION  CRITERIA 

Four  factors  are  employed  in  comparing  the  specific 
conditions  models  with  the  AAPCC,  ADG/HOSDOM  and  ADG/MDC 
models;  predictive  accuracy,     administrative  feasibility, 
incentives  for  cost  effectiveness  and  difficulty  in 
gaming  the  system. 

Individual  predictive  accuracy  is  measured  by 
comparing  adjusted  R-square  across  models.       Adjusted  R- 
square  is  preferred  over  unadjusted  for  comparison  across 
models  because  it  adjusts  for  variation  in  degrees  of 
freedom  (Ash,   1989) .       However,  with  very  large  sample 
sizes  there  is  virtually  no  difference  in  the  adjusted 
and  unadjusted  R2 . 

Group  predictive  accuracy  is  measured  using  the 
predictive  ratio;  mean  predicted  1992  payment/  mean 
actual  1992  payment    (Ash,  1989;  Ellis,    1996;  Weiner, 
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1996)  .     This  measure  is  selected  over  others  presented  in 
the  methods  literature  review  because  it  uses  a  group 
rather  than  individual  unit  of  analysis,   is  the  most 
widely  used  measure  for  this  purpose  among  the  most 
relevant  recent  studies   (Weiner,  1996;  Ellis,   1996),  is 
simple  to  calculate  and  simple  to  interpret . 

Predictive  ratios  less  than  1  indicate 
underprediction .  Ratios  greater  than  1  indicate 
overprediction    The  further  the  predictive  ratio  varies 
from  1,  the  less  accurate  the  model.     No  standard  level 
indicating  acceptable  accuracy  for  predictive  ratios  has 
been  established  (Dunn,   1995;  Fowles,   1994) .  However, 
5%  variation  from  1.0    (0.95-1.05)    is  often  used  as  a  cut 
point  for  assessing  accuracy  and  is  adopted  here  for 
comparing  predictive  accuracy  across  models     (Dunn,  1995; 
Fowles,   1994;   Ellis,    1996;  Weiner,  1996). 

A  full  sample  predictive  ratio  for  a  model  is  of 
interest  because  it  indicates  how  accurately  the  model 
estimates  future  expenditures  in  the  aggregate.  However, 
since  a  primary  goal  for  risk  adjustment  models  is  to 
reduce  financial  and  social  incentives  for  risk 
segmentation  leading  to  biased  selection   (Luft,   1995)  ,  we 
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are  most  interested  in  determining  how  accurate  models 
are  across  groups  systematically  biased  with  respect  to 
prior  year  costs,  prior  year  hospitalizations  and 
conditions  diagnosed  in  the  prior  year.     The  more 
consistantly  accurate  the  model  the  lower  the  financial 
incentives  for  risk  segmentation. 

The  most  desirable  risk  adjustment  model  would  be 
the  most  accurate,  i.e.,  have  predictive  ratios 
consistently  within  5%  of  1.0  across  differently  biased 
samples.     If  a  risk  adjuster  is  consistently  accurate  in 
predicting  expenditures  across  randomly  and  non- randomly 
biased  samples,   then  financial  incentives  for  health 
plans  to  achieve  favorable  selection  are  minimized  as  are 
incentives  to  undertreat  or  exclude  the  most  severely  ill 
enrollees.     If  incentives  to  undertreat  are  minimized 
then  beneficiaries  have  more  incentive  to  consider 
enrollment  in  Medicare  capitated  health  plans .  In 
addition  to  comparing  aggregate  predictive  ratios  for  the 
full  sample  and  two  reinsurance  levels,  predictive  ratios 
are  also  compared  across  selected  ranges  of  for  1991 
expenditures,   selected  chronic  conditions  and  groups 
biased  with  respect  to  1991  hospital  utilization. 
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The  three  remaining  evaluation  criteria 
(administrative  feasibility,  incentives  for  cost 
effectiveness  and  ease  of  gaming/auditing  the  system)  are 
also  compared  across  the  five  models  by  a  synthesis  of 
advantages  and  disadvantages  of  each  type  of  model 
reported  in  the  literature  as  discussed  in  the  previous 
chapter . 

STATISTICAL  ANALYSES 

Exploratory  data  analyses  are  performed  to  obtain 
descriptive  statistics  for  each  sample.     Chi  Square  and  F 
tests  of  homogeneity  are  used  to  confirm  no  bias  is 
introduced  to  the  split  samples  by  eliminating  ineligible 
beneficiaries  after  random  allocation  to  the  split 
samples  had  been  performed. 

Ordinary  least  squares  multiple  regression, 
frequency  distributions,   and  stepwise  multiple  regression 
using  maximum  improvement  in  R2  as  the  selection  criteria 
are  employed  in  the  process  of  selecting  and  refining 
initial  specific  condition  independent  variables  for 
final  specific  condition  model  selection  as  described 
above . 
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Generalizability  of  the  resulting  risk  adjustment 
model  is  assessed  using  split  random  samples  to  replicate 
results  of  model  parameter  estimates  (estimated 
expenditure  per  condition) ,  and  individual  predictive 
accuracy  (adjusted  R-Square) .  Sample  1   (the  "development" 
sample,  N=618,418)   is  first  used  to  calculate  an  adjusted 
R-square  and  parameter  estimates.  This  process  is 
repeated  using  sample  2  as  the  calibration.  Results, 
reported  in  Table  5.5,   are  then  compared  for  consistency. 
Consistent  results  support  model  generalizability. 

Parameter  estimates  calibrated  from  Sample  1  (the 
"development"  sample  N=618,418)   are  applied  to  cases  from 
sample  2   (the  "validation"  sample,  N=618,772)   to  predict 
expenditures  and  calculate  predictive  ratios.  Predictive 
ratios  close   (within  10%)   to  1.0  also  indicate  good 
generalizability  for  the  model. 

DATA  PROCESSING  ENVIRONMENT 

All  analyses  are  performed  on  the  HCFA' s  IBM  3090 
research  mainframe  using  SAS  software . 
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TABLE  4.2  Independent  Variables:  Specific  Condition  (SPEC  I) 


MODEL  VARIABLE 
TYPE  NAME 

Basic  Male 

Yrsovr65 
Everdisa 
Buyin_91 

SPECIFIC  DMAL_SEC 
CONDITION  DX3_MAL 
DX3_SEC 
DX3_PSY 
SEP_BCCE 

DX3_250 

DX3_251 

PC_MALNU 

OTH_MET 

DX3_330 

D340-NOP 

PARALYSD 

DX3_350 

DX3_390 

OTH_NOHF 

CHF 

DISH_PCD 

DX3_430 

DX3_440 

DX3_451 

DX3_480 

DX3_490 

DX3_507 

DX3_510 

DX3_5XX 

DX3_580 

DX3_590 

DEC_ULCR 

DX3_7XX 

DX3  740 


DESCRIPTION 
Gender 

Years  over  age  65 

Ever  Medicare  disability  eligible 

Medicaid  eligible  in  1991 

Primary  and  Secondary  Malignancy 

Primary  Malignancy 

Secondary  Malignancy 

Psychiatric  Condition 

Septicemia  or  Bacterial  Infection  in 

Condition  Classified  Elsewhere 

Diabetes  Miletus 

Endocrine  Disorder  ex.  Diabetes 

Protein/Calorie  Malnutrition 

Other  Metabolic  ex  Lipoid 

Hereditary  or  Degenerative  CNS 

CNS  Without  Paralysis 

CNS  Paralysis 

PNS  Disorders 

Acute  Rheumatic  Fever 

Other  Heart  ex.  CHF 

Congestive  Heart  Failure 

Ischemic  Heart/Pul.  Cor.  Disease 

Cerebrovascular  Disease 

Arterial  Diseases 

Vein  and  Lymph  Diseases 

Pneumonia  and  Influenza 

COPD 

Ext.  Agt.  Lung  Disease-Solids/Liq. 
Other  Respiratory 
Digestive  Disorders 
Nephritis 

Other  Urinary  Diseases 
Decubitus  Ulcer 

Various  'Opathies  &  Rheumatism 
Congenital  Anomalies 


VALUE 

0=NO,1=YES 
0-34 

0=NO,1=YES 
0=NO,1=YES 


0=NO,L 
0=NO,1= 
0=NO,1= 
0=NO,1= 

0=NO,1= 
0=NO,1= 
0=NO,1= 
0=NO,1= 
0=NO,1= 
0=NO,L 
0=NO,1= 
0=NO,L 
0=NO,1= 
0=NO,L 
0=NO,L 
0=NO,L 
0=NO,L 
0=NO,L 
0=NO,L 
0=NO,L 
0=NO,L 
0=NO,L 
0=NO,L 
0=NO,L 
0=NO,1= 
0=NO,L 
0=NO,1= 
0=NO,L 
0=NO,L 
0=NO,L 


=YES 
=YES 
=YES 
=YES 

=YES 
=YES 
=YES 
=YES 
=YES 
=YES 
=YES 
=YES 
=YES 
^YES 
=YES 
=YES 
=YES 
=YES 
=YES 
=YES 
=YES 
=YES 
=YES 
=YES 
=YES 
=YES 
=YES 
=YES 
=YES 
=YES 
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TABLE  4.3  Independent  Variables:  Specific  Conditions  &  Inp.  Stays  (SPEC  II) 


MODEL 

TYPE 

Basic 


VARIABLE 

NAME 

Male 

Yrsovr65 

Everdisa 

Buyin_91 


DESCRIPTION 
Gender 

Years  over  age  65 

Ever  Medicare  disability  eligible 

Medicaid  eligible  in  1991 


VALUE 

0=NO,1=YES 

0-34 

0=NO,1=YES 


SPECIFIC      ADM910NE  One  1991  Inpatient  Stay 

CONDITION  ADM91TW0  Two  1991  Inpatient  Stays 

WITH  ADM9 1 3PL  Three  or  More  1 99 1  Inpatient  Stays 

INPATIENT  MAL_SEC  Primary  and  Secondary  Malignancy 

STAYS         DX3_MAL  Primary  Malignancy 

DX3_SEC  Secondary  Malignancy 

DX3_PSY  Psychiatric  Condition 

SEP_BCCE  Septicemia  or  Bacterial  Infection  in 

Condition  Classified  Elsewhere 

DX3_250  Diabetes  Miletus 

DX3_25 1  Endocrine  Disorder  ex.  Diabetes 

PC_MALNU  Protein/Calorie  Makiutrition 

OTH_MET  Other  Metabolic  ex  Lipoid 

DX3_330  Hereditary  or  Degenerative  CNS 

D340-NOP  CNS  Without  Paralysis 

PARALYSD  CNS  Paralysis 

DX3_350  PNS  Disorders 

DX3_390  Acute  Rheumatic  Fever 

OTH_NOHF  Other  Heart  ex.  CHF 

CHF  Congestive  Heart  Failure 

DISH  PCD  Ischemic  Heart/Pul.  Cor.  Disease 

DX3_430  Cerebrovascular  Disease 

DX3_440  Arterial  Diseases 

DX3_45 1  Vein  and  Lymph  Diseases 

DX3_480  Pneumonia  and  Influenza 

DX3_490  COPD 

DX3_507  Ext.  Agt.  Lung  Disease-Solids/Liq. 

DX3_5 1 0  Other  Respiratory 

DX3_5XX  Digestive  Disorders 

DX3_580  Nephritis 

DX3_590  Other  Urinary  Diseases 

DEC_ULCR  Decubitus  Ulcer 

DX3_7XX  Various  'Opathies  &  Rheumatism 

DX3_740  Congenital  Anomalies 


0=NO 

0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 

0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 
0=NO 


1=YES 

1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 

1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
1=YES 
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TABLE  4.4  Independent  variables:  AAPCC  Proxy  Model 


MODEL  VARIABLE 
TYPE  NAME 

Basic  Male 

Yrsovr65 
Everdisa 
Buyin_91 


DESCRIPTION 
Gender 

Years  over  age  65 

Ever  Medicare  disability  eligible 

Medicaid  eligible  in  1991 


VALUE 

0=NO,1=YES 
0-34 

0=NO,1=YES 
0=NO,1=YES 
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TABLE  4.5  Independent  Variables:  ADG/HOSDOM 
MODEL  VARL^LE 

TYPE  NAME  DESCRIPTION  VALUE 


Basic            Male  Gender  0=NO,1=YES 

Yrsovr65  Years  over  age  65  0-34 

Everdisa  Ever  Medicare  disability  eligible  0=NO,1=YES 

Buyin_91  Medicaid  eligible  in  1991  0=NO,1=YES 


ADG/HOSDOM 


HOSDOM 

Probable  Hospitalization  Diagnosis 

0=NO,L 

=YES 

ALADG03 

Time  LLmited,Major 

0=NO,1= 

=YES 

ALADG04 

Time  Ltd.  Major,  Pri.  Infections 

0=NO,L 

=YES 

ALADG06 

Asthma 

0=NO,L 

=YES 

ALADG07 

Likely  to  recur.  Discrete 

0=NO,L 

=YES 

ALADG09 

Likely  to  recur,  Progressive 

0=NO,L 

=YES 

ALADGll 

Chronic  Medical,  Unstable 

0=NO,L 

=YES 

ALADG16 

Chr.  Specialty,  Unstable,  Ortho. 

0=NO,L 

=YES 

ALADG22 

Injuries/Adverse  Effects,  Major 

0=NO,L 

=YES 

ALADG23 

Psychiatric,  Time  Limited,Minor 

0=NO,L 

=YES 

ALADG25 

Psychiatric 

Persistent  or  Recurrent,Unstable 

0=NO,L 

=YES 

ALADG27 

Signs/Symptoms,Muior 

0=NO,b 

=YES 

ALADG28 

Signs/Symptoms,Major 

0=NO,L 

=YES 

ALADG32 

Malignancy 

0=NO,L 

=YES 
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TABLE  4.6  Independent  variables:  ADG/MDC 


MODEL 
TYPE 

Basic 


VARIABLE 

NAME 

DESCRIPTION 

VALUE 

Male 

Gender 

0=NO,1= 

=YES 

Yrsovr65 

Years  over  age  65 

0-34 

Everdisa 

Ever  Medicare  disability  eligible 

0=NO,1= 

=YES 

Buyin_91 

Medicaid  eligible  in  1991 

0=NO,1= 

=YES 

****    AMBULATORY  VISIT  DIAGNOSIS  CLASS  ******** 

VLADG03 

Time  Limited,Major 

0=NO,1= 

=YES 

VLADG04 

Time  Ltd.  Major,  Pri.  Infections 

0=NO,1= 

=YES 

VLADG06 

Asthma 

0=NO,1= 

=YES 

VLADG07 

Likely  to  recur.  Discrete 

0=NO,1= 

=YES 

VLADG09 

Likely  to  recur.  Progressive 

0=NO,1= 

=YES 

VLADGl  1 

Chronic  Medical,  Unstable 

0=NO,1= 

=YES 

VLADG16 

Chr.  Specialty,  Unstable,  Ortho. 

0=NO,1= 

=YES 

VLADG22 

Injuries/Adverse  Effects,  Major 

0=NO,1= 

=YES 

VLADG23 

Psychiatric,  Time  Limited,Minor 

0=NO,1= 

=YES 

VLADG25 

Psychiatric 

Persistent  or  Recurrent,Unstable 

0=NO,1= 

=YES 

VLADG27 

Signs/Symptoms,Minor 

0=NO,1= 

=YES 

VLADG28 

Signs/Symptoms,Major 

0=NO,1= 

=YES 

VLADG32 

Malignancy 

0=NO,1= 

=YES 

NUMBER  OF  E^PATIENT  STAYS  PER  MDC  ********** 


MDCOl  Nervous  System  0-X 

MDC03/04  E/N/T/Respiratory  System  0-X 

MDC05  Circulatory  System  0-X 

MDC06  .     Digestive  System  0-X 

MDC07  Hepatobiliary  System/Pancreas  0-X 

MDC08  Musculoskeletal/Connective  Tissue  0-X 

MDC09  Skin,  Subcutaneous  Tissue  &  Breast  0-X 

MDC  10  Endocrine/Nutrition/Metabolic  DX  0-X 

MDC  11  Kidney/Urinary  Tract  0-X 

MDC  18  Infectious/Parasitic  Diseases  0-X 

MDC  1 9/20  Mental  Disease/Alcohol/Drug  Abuse  0-X 

MDC21  Injuries/Poisenings/Bums  0-X 

MDC23/24  Health  Status  Factors/Trauma  0-X 

MDC25/16/17  Blood,  Immunological, 

Myeloproliferative/HIV/AIDS  0-X 

MDC26  Transplants  0-X 
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CHAPTER  V:  FINDINGS 
OVERVIEW 

This  chapter  presents  findings .     Results  comparing 
two  specific  condition  models  with  an  AA.PCC  proxy  and  two 
criterion  models,   the  ADG/HOSDOM  and  ADG/MDC,  are 
presented  for  all  four  evaluation  criteria:   1) predictive 
"accuracy,  2) administrative  feasibility,  3) incentives  for 
cost  effectiveness  and  4) ease  of  manipulating  and  auditing 
the  system. 

Discussion  of  results  begins  with  a  description  of 
sample  characteristics .     Findings  for  test  of  homogeneity 
between  the  split  samples  follow.  A  comparison  of  the  five 
models  with  respect  to  predictive  accuracy  (individual  R2 
and  group  predictive  ratios) ,  administrative  feasibility, 
incentives  for  cost  effectiveness  and  ease  of  gaming 
concludes  this  chapter. 

SAMPLE  CHARACTERISTICS 

Table  5.1  serves  two  purposes:   1)  reporting  the 
proportion  of  beneficiaries  with  each  dummy  variable 
discussed  here  and  2)   reporting  Chi  Square  tests  of 
homogeneity  discussed  in  the  next  section. 
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Proportions  reported  for  each  independent  variable 
indicate  the  prevalence  of  that  characteristic  in  the 
population  in  1991.     For  ease  in  describing  development 
sample  characteristics  only  proportions  from  that  sample 
are  discussed  in  the  text.     As  the  next  section  on  sample 
homogeneity  indicates,  discussing  only  one  sample  is  not 
a  biased  description  of  the  sample  since  there  are  no 
statistically  significant  differences  between  sample 
characteristics . 

Development  sample  sociodemographic  characteristics 
are  .394  male,    .062  ever  receiving  Medicare  disability, 
and  .092  eligible  for  Medicaid.     The  proportion  with  one 
or  more  hospitalizations  is   . 176 .     Proportions  of 
beneficiaries  with  selected  specific  conditions  are; 
primaary  neoplasms   (except  non-melanoma  skin  cancer)  .126, 
any  psychiatric  condition  .048,  diabetes  mellitus  without 
ESRD  .135,   congestive  heart  failure   .084,  cerebrovascular 
disease   .082,  pneumonia  and  influenza  .061  and  chronic 
obstructive  pulmonary  disease  .141. 

Table  5.2  contains  means  for  continuous  dependent 
and  independent  variables  by  sample  and  F  tests  of  sample 
homogeneity. 
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Development  sample  mean  age  is  74.5   (65  +  mean  of 
9.5  years  over  65).     Mean  adjusted  annualized  1992 
payment  is  $3,43  0. 

Data  from  another  Medicare  source,  the  Medicare 
Current  Beneficiary  Survey  (MCBS) ,   confirms  sample 
prevalence  of  selected  conditions  among  those  age  65  and 
over.  Even  though  the    MCBS  population  is  a  subset  of  the 
5%  sample  used  in  this  study,   the  sample  is  different  in 
that  it  includes  persons  with  ESRD  and  disabilities  who 
are  under  age  65.     MCBS  condition  prevalence  is  based  on 
self -reported  conditions,  not  claims  as  in  this  research, 
and    may  capture  unmet  demand  not  reflected  in  claims 
data.     Given  these  differences  in  sample  composition  and 
measurement,   some  variation  in  prevalence  is  to  be 
expected.     However,  where  comparisons  were  possible 
across  data  sources,  prevalence  is  fairly  comparable. 

Table  5.3  contains  1991  prevalence  estimates  for 
diabetes,   COPD,   hypertension  and  cerebrovascular  disease 
from  this  study  and  the  MCBS     (MCBS  Manual,   1991)  .  For 
example,   the  1991  MCBS  estimate  of  the  proportion  of 
enrollees  with  diabetes  is  14.8%  compared  with  13.5%  for 
the  Medicare  aged  in  this  study.     MCBS  COPD  prevalence  is 
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13.1%  compared  to  14.1%  in  this  study.  Prevalence 
estimates  for  cerebrovascular  disease  are  10.7%  in  the 
MCBS  and  8.2%  here.     The  greatest  divergence  in 
prevalence  between  the  two  sources  is  for  hypertension 
46.9%  for  MCBS  and     37.8%  here.     Again  this  could  be  due 
to  inclusion  of  ESRD  and  disabled  persons  in  the  MCBS 
and/or  the  difference  in  measurement  techniques- 


TABLE  5.3 

1991  Prevalence  Estimates 
for  Selected  Conditions 

Diabetes  COPD 

Hyper- 
tension 

Cerebro- 
vasular 

MCBS 

14.8%  13.1% 

46.9% 

10.7% 

THIS 
STUDY 

13.5%  14.1% 

37.8% 

8.2% 

Despite  a  few  differences  in  sample  selection  and 
data  collection  methods,   reasonable  consistency  of 
disease  prevalence  estimates  across  these  sources 
increases  confidence  in  validity  of  the  specific 
condition  variables  derived  in  this  research. 

Parameter  estimation  "benchmarks"  reported  in  the 
appendices  support  reliability  of  the  measures  for 
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comparison  models-     Benchmark  tables  compare  parameter 
estimates  derived  in  this  project  with  those  reported  in 
the  JHU  project  that  derived  the  ADG/HOSDOM  and  ADG/MDC 
models . 

SAMPLE  HOMOGENEITY 

Since  eligibility  restrictions  are  applied  after 
random  assignment  of  enrollees  to  split  samples,   tests  of 
sample  homogeneity  are  conducted  to  determine  if  any  bias 
is  introduced  by  this  process  or  by  chance . 

Chi  square  tests   (Table  5.1)   of  sample  homogeneity 
(P<=.001)   indicate  virtually  no  bias  is  introduced  to  the 
study  design  by  excluding  non-eligible  cases  from  the 
split  samples  after  they  had  been  drawn  at  random  from 
the  full  5%  sample.     Among  all  categorical  independent 
variables  incorporated  in  the  five  comparison  models, 
only  variables  related  to  septicemia  are  statistically 
significant  at  p  =   .015  with  a  slightly  higher  prevalence 
in  the  development  sample  that  is  not  evident  in  the 
table  with  proportions  rounded  to  1  decimal  place. 
Because  Chi  square  is  sensitive  to  sample  size  and  tends 
to  overestimate  statistical  significance  for  large 
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samples,  the  lack  of  any  statistically  significant  Chi 
Square  tests  among  dummy  variables  included  in  the  five 
models  for  two  very  large  samples  is  strong  evidence  of 
sample  homogeneity. 

Heterogeneity  of  split  samples  is  further  assessed 
using  one-way  analysis  of  variance  F  tests  performed  on 
all  continuous  independent  and  dependent  variables 
incorporated  in  the  models.     There  are  no  statistically 
significant  differences  between  means  below  p=.10.  Table 
5.2  shows  means,   F  test  for  difference  between  means  and 
P  value  by  variable  and  sample. 

CONSISTENCY  OF  ESTIMATES 

The  final  specific  condition  model  is  calibrated 
using  both  samples  to  assess  consistency  of  estimates 
across  two  samples.  Resulting  model  statistics  and 
parameter  estimates  are  shown  in  Table  5.5. 

SPEC  I  adjusted  R- Square  for  development  and 
validation  samples  are,   respectively,   0.0725  and  0.0691. 
Comparable  figures  for  SPEC  II  are  0.0750  (development 
R2)   and  0.0717   (validation  R2)  .     All  parameter  estimates 
remained  statistically  significant  at  the  p<=0.001  level. 
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The  direction  and  magnitude  of  parameter  estimates 
are  also  comparable  across  samples.     For  example, 
estimates  for  the  socio- demographic  factors  are, 
respectively  $606  and  $635  for  male  gender,  $41  and  $40 
for  each  year  over  age  65,   $926  and  $873  for  ever  having 
been  disabled  and  $321  and  $328  for  Medicaid  eligibility 
in  1991.     Estimates  for  selected  specific  conditions  are, 
respectively,   $3,598  and  $3,576  for  having  a  primary  and 
a  secondary  malignant  neoplasm,   $1,212  and  $1,099  for 
diabetes,     $3,116  and  $2,403  for  septicemia  only,  $937 
and  $971  for  bacterial  infections  related  to  conditions 
classified  elsewhere,    $2,058  and  $2,168  for  congestive 
heart  failure  and  $937  and  $1,024  for  chronic  obstructive 
pulmonary  disease.     Wider  variation  for  septicemia  may  be 
due  to  slight  bias  in  prevalence  between  the  two  samples 
discussed  above. 

PREDICTIVE  ACCURACY 

Table  5.6  summarizes  individual  predictive  accuracy 
and  model  characteristics  across  all  5  models  for  the 
full   (untruncated)   model  and  truncation  at  two 
reinsurance  levels;   $100,000  and  $50,000. 


163 


id 
© 


0) 
T3 
O 


c 
o 
w 

(5 

Q. 

E 
o 
O 

$ 
O 

> 
o 

T3 
(U 


0) 
.> 

o 

(U 
Q. 
(/> 
O 


c 

E  ■ 


CD 


•D 

C  CO 

< 

CM  2 

C35  CD 

°?  to 

^  0) 

.<2  S 

<u  .55 

-S 

>  (D 

cE 

■O  Q. 

C  0 

0  -S 

Q.  c 


'CDlCM 
ICMICO 
|CD1 

iOl 


CO 


CD 

CO 

|io 

CO 

268. 

io 

C3> 

CO 

CM 


!CM  CO 

io! 


■  CO  C3> 

!cm!«» 


ICMI 


1  o  E 

iU.1  i 


CD 


CD 


CD 


CD 


<0] 


CO 


O 

CJ>  CO 

CO] 

Pi 

O! 

i 


CM 


CO 


CM 


CM  r-" 


o 

ID 

CM 

cn 

CO 

co" 

co" 

CM  co' 


CM 


o 

a> 

CM 

CM 

CO 

co" 

Q. 
0) 


CO 


Specific  condition  models  rank  highest  on  individual 
predictive  accuracy  for  the  full  model  and  models 
truncated  at  $100k  and  $50k.       R2  increases  with  lower 
truncation  levels,  however,  rankings  across  models  remain 
the  same . 

SPEC  I,   the  3  0  condition  model  without  1991 
admission  dummy  variables,  has  full  model  individual 
predictive  accuracy  (adjusted  R-square)   of  0.0725  and 
0.0750  for  SPEC  II    (30  condition  model  with  those  three 
variables)  .     These  figures  exceed  those  attained  by  the 
AAPCC  model    (0.0101),  ADG/HOSDOM  model   (0.0554)  and 
ADG/MDC  model  (0.0626). 

Validation  sample  group  predictive  ratios  are  shown 
in  table  5.7  across  all  5  models. 


166 


Specific  condition  models  rank  highest  on  individual 
predictive  accuracy  for  the  full  model  and  models 
triincated  at  $100k  and  $50k.       R2  increases  with  lower 
truncation  levels,  however,  rankings  across  models  remain 
the  same . 

SPEC  I,   the  30  condition  model  without  1991 
admission  dummy  variables,  has  full  model  individual 
predictive  accuracy  (adjusted  R-square)   of  0.0725  and 
0.0750  for  SPEC  II    (30  condition  model  with  those  three 
variables)  .     These  figures  exceed  those  attained  by  the 
AAPCC  model    (0.0101),  ADG/HOSDOM  model   (0.0554)  and 
ADG/MDC  model    (0.0626) . 

Validation  sample  group  predictive  ratios  are  shown 
in  table  5.7  across  all  5  models. 
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Aggregate  ratios  are  calculated  for  the  full  1992 
Payment  and  both  reinsurance  levels.     Predictive  ratios 
are  also  calculated  for  several  systematically  biased 
samples;   samples  of  percentile  groups  within  1991  total 
payment  percentile  ranges   (top  2  5%,   10%,   5%  and  1%  of 
payments  and  their  complement  the  bottom  75%,   90%,  95% 
and  99%  of  payments) ,  samples  biased  with  respect  to 
number  of  1991  hospital  admissions   (None,  any,   one,  two 
and  three  plus)   and  samples  of  persons  with  selected 
conditions    (Diabetes,  Cerebrovascular  disease,   any  Heart 
Condition  and  Malignant  Cancers  excluding  non-melanoma 
skin  cancer) . 

These  biased  samples  are  chosen  because  they  reflect 
three  types  of  factors  that  might  be  used  to  risk  segment 
the  market;  prior  costs,  prior  use  and  specific  chronic 
conditions .     These  are  also  a  subset  of  systematically 
biased  samples  employed  in  other  Medicare  risk  modeling 
research   (Ellis  1996,  Weiner  1996)  . 

All  models  have  fairly  comparable  and  accurate 
predictive  ratios   (1.007-1.026)   for  full  sample  total 
payments  and  for  both  reinsurance  levels   (1.0-1.025)  as 
shown  in  Table  5.7. 
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The  AAPCC  proxy  model  performs  more  poorly  for  all 
biased  samples  than  any  of  the  other  models.     Most  AAPCC 
predictive  ratios    fell  well  outside  the   .950-1.05  range. 
AAPCC  predictive  ratios  for  all  non-randomly  biased 
groups  range  from  0.2  71  to  1.394.     The  AAPCC  overpredicts 
for  low  cost  enrollees  and  underpredicts  for  high  cost 
enrollees.     This  finding  is  consistent  with  other 
research  suggesting  the  inadequacy  of  the  AAPCC  for 
minimizing  biased  selection  in  capitated  health  plans 
(Brown,    1993;  Ellis,   1996;  GAO,   1994;   Gruenberg,  1996; 
Weiner,   1996) . 

Specific  condition  models  are  more  accurate 
predictors  for  systematically  biased  samples  than  the 
ADG/HOSDOM  and  ADG/MDC  models. 

SPEC  I  and  the  ADG/HOSDOM  model  have  comparable 
performance  in  samples  biased  with  respect  to  hospital 
utilization  and  tend  to  under-predict  while    SPEC  II  and 
the  ADG/MDC  models  are  more  likely  to  over-predict  for 
samples  with  hospital  admissions.     For  a  biased  sample  of 
enrollees  with  one  admission  and  three  plus  admissions, 
respectively,  both  the  Specific  Condition  ratios  (.912, 
1.114)    and  ADG/HOSDOM   (.894,    1.308)    ratios  fell  outside 
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the  5%  interval  with  the  ADG/HOSDOM  model  more  likely  to 
over-predict  for  1  admission  and  under-predict  for  three 
plus  admissions - 

For  samples  biased  with  respect  to  level  of  1991 
expenditures,   all  models  tend  to  overpredict  lower  cost 
users   (range  1.024-1.099)   and  underpredict  high  cost 
users   (.645-. 968).     The  specific  condition  model  with 
hospital  admission  dummy  variables  is  the  most  precise 
model  with  predictive  ratios  for  the  top  end  of  the  1991 
cost  distribution  from  .954-.  968  and  the  bottom  end  of 
the  distribution  from  1.026-1.063.  The  ADG/HOSDOM  is 
least  precise  with  ranges  from  .645-. 909  for  low  cost 
users  and  1.030-1.099  for  high  costs  users. 

The  same  pattern  emerges  when  predictive  accuracy  of 
the  four  models  for  hospital  utilization  is  compared. 
The  specific  condition  models  with  admission  dummy 
variables  is  most  accurate   (1.009-1.035)   and  the 
ADG/HOSDOM  least  accurate   (.670-1.070)   for  groups  with  no 
hospitalization  and  up  to  3+  hospitalizations. 

Among  samples  biased  with  respect  to  specific 
conditions,   the  ADG  derivative  models  tend  to  under- 
predict  (range   .839-. 980)   and  are  comparable  with  one 
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another.     Specific  condition  models  tend  to  overpredict 
(range  1.029-1.46)   and  are  comparable  with  one  another. 
The  Specific  Condition  models  are  roughly  10%  more 
accurate  for  a  Diabetic  cohort   (1.045  and  1.046)  compared 
to  the  ADG  models   (0.839  0.850).     The  level  of  accuracy 
for  all  four  models  is  comparable  and  within  the  .95-1.05 
range  the  other  disease  groups;     Cerebrovascular,  Heart 
and  Cancer  conditions. 

Each  model  is  now  ranked  based  on  consistency  of 
predictive  accuracy  across  all  non- randomly  biased  groups 
based  on  1991  cost  distribution,  number  of  1991 
hospitalizations,   and  four  conditions  diagnosed  in  1991. 
The  rankings  are  based  on  the  range  of  predictive  ratios 
for  each  model  across  all  biased  groups.     Using  this 
method,  model  rankings  and  range  of  predictive  ratios 
from  highest  accuracy  to  lowest  are: 

1)  SPEC  II    (  .954-1.063)  , 

2)  ADG/MDC  (.850-1.070), 

3)  SPEC  I    (  .804-1.079)  , 

4)  ADG/HOSDOM    (.670-1.099)  and 

5)  AAPCC    ( .296-1 .394) . 
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These  findings  suggest  that  financial  incentives  for 
biased  selection  are  indeed  reduced  by  specific  condition 
models.     Based  on  predictive  accuracy  alone,  specific 
condition  models  are  clearly  viable  alternatives  to  the 
AAPCC  for  the  Medicare  aged.     However,  predictive 
accuracy  alone  is  only  one  of  four  criteria  for  adopting 
a  risk  assessment  method. 

Administrative  Feasibility 

Administrative  feasibility  for  all  models  tested 
is  comparable  with  respect  to  cost,   administrative  burden 
and  applicability  across  settings  given  availability  of 
claims  data.     All  models  compared  use  administrative  data 
currently  collected  for  plan  and  management  and  claims 
payment  within  the  f ee- f or-service  sector. 

Each  of  these  claims-based  models  is  not,  however, 
equally  feasible  for  capitated  managed  care  plans  that  do 
not  collect  encounter  level  claims  data  or  have 
computerized  medical  record  systems  that  record  diagnoses 
using  ICD-9-CM  codes.     A  recently  published  study 
recommends  models  based  on  a  list  of  self -reported 
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conditions  are  preferred  absent  availability  of  - 
diagnosis-based  models  from  claims  data  (Fowles,   1996)  . 

COST  EFFECTIVENESS  INCENTIVES 

The  third  criteria  for  evaluating  risk  models  is 
incentives  for  cost  effectiveness.     Cost  effectiveness  is 
a  function  of  both  cost  and  outcomes .     Both  predictive 
accuracy  and  ease  of  auditing  a  system  impact  this 
criteria.     Models,  with  high  predictive  accuracy  that  are 
easy  to  audit  create  pressure  for  competition  among  plans 
on  price.     Where  quality  assessment  mechanisms  are  also 
in  place,   specific  condition  models  provide  incentives 
for  cost  effectiveness  that  are  at  least  comparable  to 
more  complex  clinically-based  models  like  the  ADG/HOSDOM 
and  ADG/MDC  models. 

GAMING /AUDITING 

The  fourth  criteria  for  evaluating  risk  adjustment 
models  is  gameability.     No  model  is  immune  from  gaming  or 
fraud   (Anderson  GF,   1995)  .     Because  the  ADG  derivative 
models  are  more  complex  with  respect  to  assigning 
diagnosis  codes  to  risk  classes,  these  models  would  be 
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harder  for  providers  to  manipulate.     However,   for  the 
same  reason,   they  would  also  be  harder  to  audit. 

Because  specific  condition  models  are  simple 
groupings  of  ICD-9-CM  codes,   ease  of  gameability  has  been 
cited  as  a  problem  with  specific  condition  models  (GAO, 
1994)  .     Conversely,   because  of  their  simplicity,  specific 
condition  models  are  also  easier  to  audit. 

FACE  VALIDITY 

Face  or  clinical  validity  of  the  model  is  supported 
by  two  factors.     The  use  of  the  ICD-9-CM  diagnostic 
coding  standard,  which  has  an  inherent  degree  of  clinical 
validity,   lends  face  validity  to  specific  condition 
variables  in  the  model.     Physician  reviewers,  Dr.  Neil 
Powe  and  Dr.  Albert  Wu  confirmed  clinical  validity  of 
specific  condition  indicators  is  at  least  a  good  as  the 
comparison  models  and  reasonable  given  knowledge  of 
average  costs  for  selected  conditions  based  on  the 
literature  and  experience  in  developing  other  risk 
adjustments  models. 
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SUMMARY 

In  summary,   five  risk  adjustment  models  are  compared 
using  four  evaluation  criteria;  predictive  power, 
predictive  accuracy,   administrative  feasibility  and 
gameability.     Two  specific  condition  models  are     at  least 
comparable  to  ADG/HOSDOM  and  ADG/MDC    models  derived 
specifically  for  the  Medicare  aged  population  and 
outperform  the  AAPCC.     Comparability  with  the  ACG/PACS 
derivatives  suggests  that  claims-based  specific  condition 
models,   like  their  self -report  analogs   (Gruenberg,  1996; 
Fowles,    1994  1996)   are  indeed  viable  alternatives  to  the 
AAPCC. 
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CHAPTER  VI:  DISCUSSION 


Implications 

The  findings  presented  in  the  previous  chapter  have 
several  implications  discussed  in  more  detail  below. 
These  implications  can  be  summarized  as  follows: 

1)  Claims-based  specific  condition  models  are  viable 
alternatives  to  Medicare's  AAPCC  for  the  Medicare 
aged . 

2)  Specific  condition  models  could  be  customized  for 
application  to  other  populations   (e.g.  Medicare 
ESRD,  Medicare  Disabled,   Medicaid,  Employees) . 

3)  Specific  condition  models  could  be  applicable  for 
risk  adjusting  process  and  outcomes  assessment  (e.g. 
guideline  compliance  and  comparisons  across 
providers) . 

4)  Generalized  software  and  a  guide  for  implementing 
the  seven  step  specific  condition  modeling 
methodology  employed  here  could  be  produced  to 
facilitate  development  of  specific  condition  risk 
assessment  models  customized  for  other  populations 
and  for  outcomes  risk  adjustment. 


Claims-based  specific  condition  models  are  viable 
alternatives  to  Medicare's  AAPCC  because  they  outperform 
Medicare's  current  AAPCC  method  and  are  at  least 
comparable  to  two  accepted  alternatives,  the  ADG/HOSDOM 
and  ADG/MDC  models,  on  four  evaluation  criteria; 
predictive  accuracy,   incentives  for  cost  effective 
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healthcare,  ability  to  audit  for  fraud  or  gaming  -and 
administrative  feasibility. 

The  first  evaluation  criteria,  predictive  accuracy, • 
has  two  components;  individual  and  group  accuracy.  Risk 
assessment  researchers  differ  over  whether  individual  or 
group  predictive  accuracy  is  more  important  when 
evaluating  risk  models    (Dunn,   1995) .     The  relative 
importance  of  the  measures  varies  depending  on  the 
objectives  for  the  risk  measure   (Dunn,   1995) . 

Predictive  accuracy  at  the  individual  level  is  of 
interest  when  the  primary  objective  is  to  minimize 
incentives  for  selection  within  risk  groups  at  enrollment 
or  through  disenrollment    (Newhouse,    198  9) .     This  issue  is 
particularly  important  for  the  most  vulnerable  high  cost 
individuals  and  those  with  chronic  conditions  because  it 
effects  the  accessibility  of  health  care  for  those  with 
the  greatest  need  (Anderson,   1990;  Kronick,  1995). 
However,   as  Kronick  points  out,   risk  classes  formed  using 
specific  chronic  or  high  cost  conditions  decrease  the 
chance  of  discrimination  against  individuals  with  those 
conditions  and  provide  incentives  for  health  plans  to 
enroll  such  members  because  plans  receive  increased 
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payments  for  patients  with  those  conditions  (Kronick, 
1996) . 

If,   as  here,  the  primary  objective  is  to  set  correct 
payment  rates  for  groups    of  enrol lees  that  minimize 
incentives  for  selection  then  group  predictive  accuracy 
is  of  highest  priority  (Hornbrook,    1995;   Luft,    1995) . 
"...  [C]  oncern  about  the  poor  explanatory  power  of  models 
focusing  on  individual  expenditures  is  misplaced  if  the 
real  question  is  how  well  one  can  predict  for  groups, 
particularly  groups  reflecting  nonrandom  risk- related 
selection."   (Luft,  1995). 


Table  6.1  summarizes  group  and  individual  predictive 
accuracy  findings. 


Table     6 . 1 

Summary  of 

Predictive  Accuracy 

Models 

Pr'ed.  Ratio 

Ranked 

Range  of 

Adj .  R2 

By  Group 

Group 

Group 

Individual 

Ind 

Accuracy 

Rank 

Accuracy 

Accuracy 

Rank 

SPEC  II 

1 

. 954-1 . 063 

0 . 0750 

1 

ADG/MDC 

2 

.850-1.070 

0.0626 

3 

SPEC  I 

3 

.  804-1. 079 

0  .  0725 

2 

ADG/HOSDOM 

4 

.670-1. 099 

0 . 0554 

4 

AAPCC  PROXY 

5 

.296-1.394 

0 . 0101 

5 
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Model  ranking  is  based  on  consistency  of  group 
predictive  accuracy  for  biased  groups .     Groups  are 
biased  based  on  enrollee  health  status  characteristics 
that  could  be  used  to  risk  segment  the  market  such  as 
1991  expenditure  level,    1991  hospital  use  and  specific 
conditions  present  in  1991   (see  Table  5.7  page  169  for  a 
detailed  listing  of  100  predictive  ratios,  20  groups  for 
each  of  5  models,   summarized  in  Table  6.1) . 

Model  ranks  based  on  individual  accuracy  (Adjusted 
R2)   are  also  reported  in  Table  6.1.     Group  and  individual 
predictive  accuracy  ranks  are  equivalent  for  3  of  5 
models.     SPEC  II  ranks  1st,  ADG/HOSDOM  4th  and  AAPCC 
proxy  5th  for  both  measures.     The  ADG/MDC  model  ranks  2nd 
for  group  accuracy  and  3rd  for  individual  accuracy.  The 
SPEC  I  model  ranks  3rd  in  group  and  2nd  in  individual 
accuracy. 

From  these  rankings  it  is  clear  that  the  specific 
condition  models  exceed  predictive  accuracy  of  the  three 
comparison  models,  and,   therefore,   could  reduce 
incentives  for  health  plans  to  risk  segment  the  Medicare 
aged  market.     However,  predictive  accuracy  is  only  one  of 
four  evaluation  criteria.     We  must  also  consider 
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incentives  for  cost  effective  care,   ease  of 
auditing/gaming  the  system  and  administrative 
feasibility. 

Incentives  for  cost  effective  care  are  a  function  of 
two  factors;  1)   the  type  of  predictors  in  the  model  and 
2)   degree  of  predictive  accuracy. 

Type  of  predictors  in  a  risk  assessment  model  are  limited 
by  how  those  predictors  are  perceived  with  respect  to  their 
influence  on  incentives  for  cost  effective  care.     This  policy 
consideration  may  result  in  elimination  of  certain  types  of 
predictors  in  two  different  stages  of  the  risk  assessment 
modeling  process.     First,   the  conceptual  framework  (Figure 
III -3  on  page  60)     for  applied  risk  adjustment  of  capitation 
payments  to  health  plans  omits  domains  of  factors  with  obvious 
disincentives  for  achieving  objectives  of  a  risk  model. 

Plan,  provider  and    most  community  characteristics  were 
omitted  from  the  conceptual  framework.     Adjusting  for 
differences  in  plan  and  provider  characteristics  would  reward 
inefficiencies  in  plan  or  provider  practices.     For  example,  if 
a  plan  that  provides  health  care  to  a  broad  spectrum  of  the 
population  has  a  much  greater  proportion  of  high  cost 
specialists  than  a  typical  plan  and  "proportion  of  specialists 
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in  the  plan"  is  a  risk  factor  included  in  the  model,  that  plan 
would  be  rewarded  for  overstaf fing  with  specialists  and 
charging  higher  rates  to  cover  their  costs  by  having  their 
estimated  mean  cost  adjusted  upward.     In  contrast,  plans  with  a 
more  cost  effective  staffing  mix  would  be  penalized. 

Community  indicators  other  than  geographic  location  (e.g. 
health  resource  availability  such  as  hospital  beds/1000)  are 
not  typically  in  the  conceptual  framework  because  either  data 
is  not  readily  available  or,   like  plan  and  provider 
characteristics,  those  predictors  would  provide  inappropriate 
incentives  if  included  in  the  model.     For  example,  including 
hospital  beds  per  1000  population  could  be  contrary  to  the 
incentives  for  cost  effectiveness  by  rewarding  inflated  rates 
due  to  excess  hospital  capacity  or  inefficiencies  due  to 
supplier  induced  demand. 

Prior  use  measures  fall  within  the  health  status 
domain  and  are  not  generally  excluded  from  the  conceptual 
framework.     Prior  use  measures,  which  are  criticized  for 
providing  incentives  for  increasing  use   (Anderson,  1990; 
GAO,   1994),  are  included  in  two  of  the  comparison  models. 

The  ADG/MDC  model  contains  14  variables  that  count 
1991  admissions  based  on  primary  discharge  diagnosis 
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within  each  of  14  diagnosis  groupings .     Because  of  the 
potential  incentives  for  these  variables  to  increase 
hospital  use,   the  less  accurate  ADG/HOSDOM    model  was 
recommended  as  the  best  model  by  the  developers  because 
it  does  not  include  any  direct  prior  use  measures 
(Anderson,   1997) . 

SPEC  II,  the  top  ranking  model,  has  3  mutually 
exclusive  dummy  variables  indicating  level  of  1991 
hospital  use   (e.g.  1  stay,  2  stays  or  3+  stays) . 
Presence  of  these  variables  renders  this  model  subject  to 
the  same  type  of  criticism  leveled  at  models  that  include 
prior  use  variables . 

If  models  with  prior  use  variables  providing 
disincentives  for  cost  effective  care  are  eliminated 
during  the  evaluation  stage  of  the  modeling  process, 
accuracy  of  payments  remains  a  key  factor  in  promoting 
cost-effective  or  appropriate  care   (i.e.  avoiding 
unnecessary  treatments  and  providing  necessary  care 
especially  to  the  most  seriously  ill) . 

Discussion  of  two  evaluation  criteria  remain  for 
determining  whether  specific  condition  models  are  viable 
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alternatives  to  the  AAPCC;  concerns  over  gaming  the 
system  and  administrative  feasibility. 

Absent  fraud  or  malpractice,   concern  over  ease  of 
gaming  and  auditing  claims -based  risk  models  pertains  to 
systematic  bias  in  reporting  diagnoses  commonly  refereed 
to  as  "upcoding"   i.e.  recording  higher  cost  diagnoses 
than  the  true  diagnosis  for  a  patient   (Weiner,   1996) . 
Annual  recalibration  of  risk  adjustment  models  limits  the 
effect  of  gaming  by  upcoding  of  diagnoses   (Ellis,  1996). 

Upcoding  concerns  are  equally  relevant  to  all  five 
models.     However,   the  SPEC  I  and  II  models  include 
diagnoses  from  laboratory  procedure  claims  to  indicate 
presence  of  a  disease.     Using  diagnoses  from  lab  claims 
could  encourage  increased  lab  testing  for  rule  out 
conditions  causing  artificially  inflated  payments  to 
plans.     This  gaming  concern  for  the  SPEC  I  and  SPEC  II 
models  would  be  eliminated  by  omitting  lab  diagnoses  when 
defining  presence  of  conditions  for  specific  condition 
models . 

The  relative  ease  of  gaming  concerns  for  specific 
condition  models  may  be  offset  by  the  availability  of 
existing  and  emerging  technologies  for  monitoring  the 
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quality  of  health  care.  Recent  Medicare  quality  - 
assessment  research,   demonstrations  and  implementation 
projects  exemplify  quality  assurance  technologies  and 
processes  that  could  also  function  as  audit  mechanisms  to 
counterbalance  the  ease  of  gaming  specific  condition  risk 
adjustment  models;  e.g.  DEMPAC  (Lawthers  1993;  Parente, 
1995;  Weiner,   1995) ,  HEDIS  report  card  and  accreditation 
standards  and  HCFA' s  Health  Standards  Quality  Improvement 
Program  (HSQIP)   initiatives.  These  quality  review  methods 
are  designed,   in  part,  for  application  to  "tracer 
conditions"    (Ressner  DM,   1973) . 

If  the  conditions  in  the  model  were  also  tracer 
conditions  for  measuring  adherence  to  practice  guidelines 
and  providing  feedback  to  providers  regarding  quality  of 
care  and  practice  pattern  variations,   then  specific 
condition  models  could  be  less  burdensome  to  audit  than 
more  complex/indirect  measures   (e.g.,  DCGs,  ACGs  and 
ADGs)  because  quality  assurance  processes  could  double  as 
a  basis  for  auditing  risk  model  gaming  practices.  This 
implication  is  inferred  from  various  research  (Berwick, 
1991;  Eddy,    1984;  Hillman,    1993;   Kessner,   1973;  Kronick, 
1996;   Laffel,    1992;   Lawthers,  1993) 
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In  addition,   the  California  HIPC   (Shewry  199-6)  has 
developed  a  process  to  audit  health  plan  reporting  of 
patient  volume  for  each  of  the  3  8  conditions  in  their 
Marker  DX  model  used  to  risk  adjust  payments  to  health 
plans'   small  group  and  individual  enrollees.     Their  audit 
process  is  relatively  simple  because  it  tracks  only  the 
38  high  cost  hospital  diagnoses  in  the  Marker  DX  model 
and  most  plans  have  a  hospital  utilization  tracking 
system.     If  combined  with  ambulatory  audits  such  as 
DEMPAQ  procedures  for  assessing  ambulatory  care  using 
medical  records  or  claims,  there  may  be  a  way  to  develop 
an  audit  process  for  a  limited  set  of  specific  conditions 
(e.g.,   the  30  conditions  in  the  SPEC  I  model)  across 
provider  settings  that  is  not  too  burdensome. 

Absent  a  longitudinal  demonstration  project  to 
assess  the  type,   degree  and  impact  of  gaming  practices 
across  different  risk  assessment  models,  we  can  only 
theorize  about  the  relative  ease  of  gaming  and  auditing 
of  these  models.     Recently,  experts  in  the  field  deemed 
auditing  capability  for  self-report  specific  condition 
models  and  more  complex  claims  models   (ACGs)   to  be 
similar   (Fowles,    1994)  . 
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Administrative  feasibility  for  claims -based  models 
using  diagnosis  codes  is  comparable  across  models 
provided  software  exists  for  their  implementation 
(Fowles,   1994  1996) .     However,   claims-based  specific 
condition  models  may  have  greater  administrative 
feasibility  than  more  complex  models  for  plans  that  do 
not  have  encounter  level  claims  data  for  two  reasons. 

First,   as  in  the  California  small  group  and 
individual  HIPC,  health  plans  that  do  not  have  claims 
systems  could  be  required  to  report  the  proportion  of 
their  enrollees  with  each  condition  in  the  model  (Shewry, 
1996) .     If  health  plans  had  specific  tracer  condition 
quality  assessment  processes  and  the  tracer  conditions 
included  were  in  the  specific  condition  risk  model,  then 
health  plans  using  a  quality  assessment  mechanism  such  as 
DEMPAC     (Lawthers,   1993)   which  uses  medical  record 
abstracts  or  claims  data  may  be  able  to  determine  the 
proportion  of  enrollees  with  various  combinations  of 
coexisting  conditions  from  the  quality  assessment 
process. 

Second,   it  may  be  possible  to  translate  the  claim- 
based  specific  conditions  into  self-report  survey 
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questions  for  application  to  a  representative  sample  of 
the  population   (Gruenberg  1996)  .     If  so,   then  specific 
condition  models  would  be  more  broadly  applicable  across 
all  health  plans  and  to  new  enrollees  than  more  complex 
claims-based  models  that  can  not  be  adapted  for  self- 
report  data  collection  (e.g.  ADG/HOSDOM  and  ADG/MDC)  . 

Recent  research  reviewed  in  detail  in  Chapter  IV 
(Gruenberg,   1996;  Fowles,   1996)   suggests  self -report 
specific  condition  models  perform  as  well  as  claims-based 
models  with  respect  to  predictive  accuracy,  incentives 
for  cost  effective  care,  gaming/  auditing  and 
administrative  feasibility  when  applied  to  a  random 
sample  of  plan  enrollees   (Fowles  1996).     In  fact,  for 
plans  without  encounter  level  claims-data,   self -report 
specific  condition  models  are  recommended   (Fowles,   1996)  . 
This  recommendation  was  made  even  though  various  untested 
issues  with  respect  to  gaming/auditing  exist  for  self- 
report  risk  models  that  were  not  mentioned  above  in  the 
discussion  of  gaming  concerns  for  claims-based  models 
compared  here.     Self -report  gaming  concerns  and  their 
possible  resolution  are  now  mentioned  in  support  of 
potential  greater  administrative  feasibility  for  claims - 
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based  specific  condition  models  given  their  translation 
into  self -report  measures. 

If  MCOs  were  to  design  and  administer  surveys  to 
collect  risk  predictor  information  that  information  could 
be  biased  in  several  ways  (Fowles,  1994)  .     The  survey 
instrument  could  be  worded  in  such  a  way  as  to  elicit 
inflated  risk  responses .     The  sampling  frame  or  response 
follow-up  could  be  biased  toward  the  least  healthy 
enrollees  thereby  inflating  the  relative  health  risk  for 
that  plan.  These  types  of  bias  could  be  controlled  by  use 
of  a  standardized  health  risk  survey  administered  by  a 
third  party  survey  research  firm,  e.g.  Westat  or  Research 
Triangle  Institute   (Fowles,  1994) . 

Two  other  sources  of  self -report  bias  remain 
(Fowles,   1994)  .     Response  bias  could  result  in 
underreporting  by  enrollees  for  fear  of  repercussions 
from  the  MCO.     Provider  "coaching"  through  newsletters, 
discussions  with  health  care  providers,   etc.  could  lead 
to  "upreporting" ,  the  self -report  data  collection 
equivalent  of  "upcoding"  in  claims  data.     As  with  claims 
data,   the  latter  problem  could  be  resolved  by  annual 
recalibration  of  the  risk  model   (Ellis,  1996). 
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The  PPRC  report  recommendation  to  use  self -report 
specific  condition  models  for  health  plans  that  do  not 
have  encounter  level  claims  data  is  especially  relevant 
to  Medicare  because  Medicare  already  supports  two 
enrol lee  survey  processes  designed  and  administered  by 
third  party  survey  research  firins. 

One  survey  of  Medicare  enrollees  became  operational 
in  1991,   the  Medicare  Current  Beneficiary  Survey  (MCBS) 
(MCBS  Manual,    1991) .  The  MCBS  is  a  longitudinal  panel 
survey  of  approximately  15,0  00  beneficiaries  that 
comprise  a  representative  sample  of  all  Medicare 
enrollees.     The  MCBS  collects  and  links  respondent  claim 
data  and  survey  data  including  self-report  specific 
conditions   (e.g.   "has  your  doctor  ever  told  you  had 
diabetes" ) . 

Another  survey  is  being  developed  for  HCFA  by  the 
Research  Triangle  Institute,   the  Medicare  Beneficiary 
Health  Status  Registry  (MBHSR)    (Turner  CF,   1994)  .  The 
MBHSR  is  a  survey  of  all  new  Medicare  enrollees  enabling 
collection  of  specific  conditions  at  initial  enrollment. 
A  sample  size  of  about  40,000  is  planned  along  with  panel 
reinterviews  at  designated  intervals. 
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A  standardized  supplement  of  the  MCBS  and/or  MBHSR 
surveys  that  captures  socio-demographic  and  specific 
condition  variables  derived  from  a  claims-based  specific 
conditions  model    (e.g.,   SPEC  I)   could  be  developed  and 
administered  by  a  third  party,    (e.g.  Westat  or  Research 
Triangle  Institute) ,   to  a  representative  sample  of 
enrollees  from  plans  with  no  claims  data  and  for  new 
Medicare  enrollees. 

In  summary,   the  administrative  feasibility  of 
claims -based  specific  condition  models  is  comparable  to 
more  complex  claims -based  risk  adjustment  models  such  as 
DCGs,  ACGs  and  ADGs .     However,   administrative  feasibility 
for  specific  condition  models  may  be  greater  than  more 
complex  models  since  a  self -report  version  could  be 
applied  to  a  representative  sample  of  plan  enrollees 
where  no  claims  data  exists  in  a  manner  that  addresses 
gaming  concerns  unique  to  self -report  methods. 

The  preceding  discusses  implications  of  the 
comparison  for  all  five  models  across  the  four  evaluation 
criteria  for  risk  assessment  models.     That  discussion  is 
summarized  in  Table  6.2  below. 
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Table  6.2  Evaluation  Summary  Table 


Group  Ind . 
Accuracy  Accu: 

Rank  Rank 

SPEC  II       1  1 

ADG/MDC       2  3 

SPEC  13  2 

ADG/  4  4 

HOSDOM 

AAPCC  5  5 

PROXY 


Incent .  Ease  of  Admin . 

C/E  Gaming/Sc  Feasi- 

Care  Auditing  bility 

-Pri  Use  =Game  =CLM 

+Prd  Acc  =Audit  +S/R 

-Pri  Use  =Game  =CLM 

+Prd  Acc  =Audit  -S/R 

=Pri  Use  =Game  =CLM 

+Prd  Acc  =  Audit  +S/R 

=Pri  Use  =Game  =CLiyi 

+Prd  Acc  =Audit  -S/R 

++PriUse  +Game  =CLM 

-Prd  Acc  +Audit  +S/R 


Key:  Rank  1  highest,    5  lowest 

--  Poor,  -  Fair,  =  Good,  +  Very  Good,  ++  Excellent 
Pri  Use  =  Prior  Use,  Prd  Acc  =  Predictive  Accuracy 
Game  =  Gaming,  Audit  =  Auditing 

The  claims -based  specific  condition  models  developed 
and  evaluated  in  this  dissertation  project  are  clearly 
viable  alternatives  to  the  AAPCC  because 


1)  predictive  accuracy  exceeds  that  of  the 
ADG/HOSDOM,  ADG/MDC  and  AAPCC  proxy  comparison 
models . 


2)   Incentives  for  cost  effective  care  are  much 
higher  than  the  AAPCC  proxy  and  somewhat  higher  than 
ADG/HOSDOM  and    ADG/MDC  comparison  models. 
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3)  on  balance,  gaming  issues  for  claims-based  models 
are  comparable  to  ADG/HOSDOM  and    ADG/MDC.  However, 
specific  condition  models  may  have  an  audit 
advantage  over  more  complex  models  because  of  their 
simplicity.     In  addition,   if  conditions  in  the 
models  are  used  as  tracer  conditions  for  monitoring 
quality,  absent  fraud  or  malpractice,  quality 
assessment  processes  would  also  serve  as  specific 
condition  risk  model  audits  and  provide 
disincentives  for  gaming  the  system. 

4)  administrative  feasibility  is  comparable  to 
ADG/HOSDOM  and    ADG/MDC  comparison  models  but  may  be 
greater  than  these  models  if  specific  condition 
models  can  be  adapted  for  standardized  self-report 
administration  by  third  parties  to  new  enrollees  and 
health  plans  that  do  not  have  encounter  level  claims 
data . 


A    second  implication  of  this  research  is  that 
specific  condition  models  could  be  applied  to  other 
populations   (e.g.  Medicare  ESRD,   Medicare  Disabled, 
Medicaid,  Medicaid  Disabled,   Children  and  the  employed) . 
Three  studies  reviewed  suggest  applicability  of  specific 
condition  models  to  other  populations  such  as  individuals 
and  small  group  employees   (California  HIPC,   1995) ,  the 
Medicaid  Disabled  (Kronick,   1996)   and  the  employed 
(Fowles,   1994  1996)  . 

The  seven- step  methodology  employed  here  for  the 
Medicare  aged  could  be  replicated  to  develop  and  evaluate 
specific  condition  models  for  other  populations  in  MCOs 
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with  encounter  level  claims  data.     One  would  expect  the 
specific  conditions  included  in  a  model  and/or  their 
parameter  estimates  to  vary  by  population. 

Findings  of  the  PPRC  study  (Fowles,  1994)  suggest 
extension  of  the  specific  condition  methodology  to  the 
employed  may  be  particularly  useful.     In  that  study,  13 
self -reported  conditions  were  predictive  for  a  combined 
aged  and  employed  population.     When  the  two  populations 
were  examined  separately,   two  different  models  with  10 
conditions  in  each  model  emerged.    Although  the  majority 
of  the  original  13  conditions  appeared  in  both  models, 
some  conditions   (e.g.  Congestive  Heart  Failure)  ,appeared 
only  in  the  aged  model  and  parameter  estimates  (average 
cost)   for  condition  in  the  two  models  differed  (Fowles, 
1994)  . 

A     third  implication  of  the  research  findings  in 
this  study  is  that  specific  condition  models  may  be 
applicable  for  risk  adjusting  certain  types  of  health 
outcomes  assessment    (lezonni,   1994;  Weiner  1993)  .  As 
lezonni  notes,   the  concept  of  "risk"  is  relative  and 
inextricably  linked  to  a  specific  outcome  of  interest 
(lezonni,  1994)  . 
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For  example,  if  the  outcome  of  interest  is  hospital 
mortality  rates  with  2%  of  the  national  norm  for  the 
purpose  of  identifing  hospitals  with  higher  and  lower  than 
average  risk  of  mortality,  then  each  hospital's  mortality 
rate  must  be  adjusted  for  case  mix  in  order  to  ensure  an 
equitable  evaluation.    Hospitals  treating  the  sickest 
populations  would  have  a  higher  relative  risk  of  mortality 
so  their  rates  would  be  lowered  based  on  their  risk 
relative  to  the  norm.     Conversely,  the  mortality  rate  of 
hospitals  with  the  healthiest  populations  should  be 
adjusted  upward  to  reflect  their  adjusted  risk  of 
mortality  relative  to  the  norm. 

The  seven- step  methodology  developed  here  could  be 
applied  to  derive  a  set  of  specific  condition  predictors 
appropriate  for  risk  adjusting  comparisons  of  quality 
among  providers,  e.g.,  hospital  mortality  rates  across 
hospitals  or  adherence  to  disease  specific  guidelines. 
The  conceptual  framework  for  risk  adjustment  of  outcome 
assessments  will  vary  depending  upon  the  outcome  of 
interest  and  legitimate  sources  of  variation  in  that 
outcome  that  should  be  adjusted  for  to  ensure  equitable 
outcomes  assessment. 
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Risk  factors  included  in  an  outcomes  assessment  risk 
adjustment  model  should  be  tailored  to  serve  outcomes 
specific  objectives   (lezzoni  1994)  ,   e.g.  equitable 
comparisons  across  providers  and  quality  care.     If  the 
comparison  is  disease  or  procedure-specific,  those  risk 
factors  may  necessarily  vary  and  require  customization  of 
the  model   (i.e.,  reapplication  of  the  7  step  model 
development  process)   for  each  outcome  measure. 

A  fourth  implication  of  this  study  is  that 
generalized  software  and  a  guide  for  implementing  the 
seven  step  specific  condition  modeling  methodology 
developed  here  could  be  created.     Availability  of 
generalized  software  accompanied  by  a  guide  for 
application  throughout  the  seven  steps  outlined  on  page 
125  would  facilitate  extension  of  specific  condition  risk 
modeling  to  other  populations  and  for  outcomes 
assessment.  Feasibility  of  this  implication  is  supported 
by  availability  of  generalized  software  for  the  more 
complex  ACG   (marketed  by  CSC  for  JHU)  ,  DCG  and  HCC 
(Marketed  by  HERI)   risk  assessment  models. 
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Limitations 

Findings  from  this  research  are  limited  to  the 
Medicare  aged  and  to  capitation  risk  adjustment  for  that 
population.     Additional  research  exploring  the 
applicability  of  claims-based  specific  condition  models 
to  other  populations  and  for  other  purposes  is  needed. 
Recent  research  suggests  that  extension  of  claims -based 
specific  condition  models  to  other  populations  would  be 
fruitful   (Fowles,   1994;  Gruenberg,    1996;   Kronick,  1996; 
Shewry  1996)  .     In  addition,   findings  are  based  on  data 
from  1991  and  1992,     Replicating  this  study  using  data 
from  1996  and  1997  would  provide  more  up-to-date 
assessment  of  cost  effects  associated  with  specific 
conditions . 

Comparison  of  predictive  ratios  between  the  specific 
condition  models  and  the  ADG/MDC  or  ADG/HOSDOM  models 
are  valid  only  for  the  version  of  the  models  represented 
here.     The  ADG/MDC  and  ADG/HOSDOM  models  are  evolving. 

Comparison  of  predictive  ratios  between  the  specific 
condition  models  and  the  ADG/MDC  or  ADG/HOSDOM  models 
may  be  biased  given  that  the  operational  definition  of 
independent  variables  differ  slightly.  This  issue  is 
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distinct  from  the  gaming  concerns  discussed  above- 
concerning  the  viability  of  specific  condition  models  as 
alternatives  to  the  AAPCC.  This  issue  pertains  to  the 
difference  in  sampling  validity  of  health  status  measures 
that  could  effect  the  precision  of  the  measure  at  a 
initial  measurement  when  gaming  cannot  be  an  issue. 
Independent  variables  derived  for  specific  conditions 
include  lab  and  inpatient  admission  diagnoses.  ADG/MDC 
and  ADG/HOSDOM  predictors  do  not. 

Several  limitations  related  to  the  accuracy  and 
completeness  of  claims  data  are  listed  below.  However, 
many  of  those  limitations  decline  over  time  with 
improvements  in  claims  processing  systems    (Weiner,   1996) . 
This  list  of  claims  data  limitations  is  a  composite 
derived  from  a  portion  of  the  vast  literature  citing  such 
defects   (Anderson,   1990;  Connel,   1987;  Ellis,  1996; 
Fowles,   1994;  Hsai,    1988;   lezzoni,   199x;  Jencks  199x, 
Lave,    1994;   Mithcell,    1994;   Romano,    1992;   Weiner,  1990 
1996)  .     A  demonstration  project  would  be  necessary  to 
quantify,   describe  and  compare  the  nature  of  errors  in 
risk  assessment  models  derived  from  a  claims  data. 
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Accuracy  of  claims  data  may  be  compromised  by  coding 
errors  or  intentional  bias  through  "upcoding"  for 
purposes  of  gaming  the  system.     Changes  in  coding  schemes 
(e.g.   ICD-9-CM  or  CPT-4)   can  affect  the  meaning  of  data 
over  time  and,   therefore,   the  accuracy  or  validity  of 
health  status  measures  derived  from  diagnosis  or 
procedure  code  sets.  To  my  knowledge,  no  adjustment  to 
the  data  for  this  issue  was  made  by  HCFA  in  creating  the 
files  or  by  the  project  team  when  developing  the  analytic 
files  used  here.     Variation  in  coding  practices  among 
providers  or  administrators  may  result  in  more  or  less 
complete  coding  of  diagnosis  for  patients  with  the  same 
set  of  morbidities  which  could  also  effect  the  accuracy 
or  validity  of  health  status  measures  derived  from 
diagnosis  or  procedure  code  sets.  No  adjustment  was  made 
for  this  issue. 

Completeness  of  claims  data  can  be  compromised  by 
policy  changes  in  benefits  covered  and  coordination  of 
benefit  rules.  No  adjustment  was  made  for  this  issue. 
Data  processing  errors  may  result  in  omission  of  data  or 
in  data  duplication.     Basic  editing  of  initial  data  files 
(e.g.   reasonability  checks  of  frequency  distributions  of 
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claims  by  month,   age  and  gender)   addressed  this  problem 
in  a  rudimentary  way. 

One  final  limitation  involves  use  of  analytic  files 
created  by  other  research  teams  and  the  possibility  of 
error  outside  the  control  of  the  investigator.  However, 
given  the  expertise  of  the  two  research  teams  that 
developed  analytic  files  used  here     (Ellis,   1996;  Weiner, 
1996) ,   that  possibility  is  reduced. 
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CHAPTER  VII:  CONCLUSION 


Claims-based  simple/direct  specific  condition  risk 
assessment  models  are  viable  risk  adjusters  for 
capitation  payments  and  may  be  useful  for  risk  adjustment 
of  outcomes  assessment.     The  findings,  implications  and 
limitations  of  this  study  raise  questions  that  warrant 
additional  research. 

OPPORTUNITIES  FOR  FUTURE  RESEARCH 

Six  opportunities  for  further  research  are  presented 
below. 

First,   further  refinement  for  specific  conditions 
predictors  among  the  Medicare  aged  might  be  pursued  to 
determine  if  predictive  accuracy  could  be  maintained, 
administrative  feasibility  expanded,   ease  of  auditing 
increased  and  incentives  for  gaming  the  system  reduced. 
This  refinement  would  be  achieved  by  repeating  the  study 
with  more  recent  data   (e.g.,   1995/1996),   including  data 
for  MCOs  and  making  the  following  changes  to  the 
methodology.     A  first  step  would  be  to  omit  diagnoses 
from  laboratory  claims  and  inpatient  admission  diagnoses 
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when  constructing  the  independent  variables.     Second,  one 
might  apply  two  additional  criteria  when  defining  and 
selecting  specific  condition  predictors  in  steps  5  and  7 
of  the  model  development  process : 

1)  ease  of  use  as  a  tracer  condition  for  quality 
assessment. 

2)  ease  of  use  as  a  self -report  condition  for 
survey-based  data  collection. 

A    second  opportunity  for  future  research  is  to 
replicate  the  7  step  process  developed  here  for  other 
populations   (Medicaid  Disabled,  Medicare  ESRD,  Medicaid, 
children  and  employees)   to  develop  and  evaluate  specific 
condition  capitation  risk  adjustment  models  customized 
for  each  of  those  populations . 

A    third  opportunity  is  to  replicate  the  7  step 
process  developed  here  for  process  and  outcomes  risk 
adjustment  where  application  of  a  specific  condition  risk 
adjuster  would  be  appropriate  to  the  objective  of  that 
assessment. 

A  fourth  opportunity  for  future  research    would  be 
to  translate  the  SPEC  I  and  SPEC  II  models  developed  here 
into  self -report  survey  measures.     Standardized  Medicare 
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Current  Beneficiary  Survey  and/or  Medicare  Beneficiary 
Health  Status  Registry  plan  specific  supplements  that 
incorporate  those  self-report  measures  could  test 
viability  and  comparability  of  claims -based  and  survey- 
based  specific  condition  models. 

This  would  allow  survey  supplement  design  to  include 
conditions  known  to  be  predictive  of  future  health  cost 
from  claims-based  risk  models.     In  addition,  where  claims 
and  survey  data  exist  for  the  same  individuals   (e.g.  the 
MCBS  and  MBHSR) ,     research  could  test  whether  there  is 
any  systematic  difference  in  predictive  accuracy  for 
identical  risk  models  derived  from  claims  and  survey 
data.     That  research  might  also  examine  whether  there  is 
a  data  source  bias  and,   if  so,  to  quantify  that  bias  and 
examine  its  source    (e.g.  unmet  demand  among  enrollees 
with  a  low  propensity  to  use  the  health  care  system, 
unmet  demand  among  those  in  the  earlier  stages  of  disease 
progression,  new  enrollees  for  whom  no  prior  year  claims 
exist,   differences  in  time  lag  between  data  collection 
methods  etc . ) 

A  fifth  opportunity  for  research  is  exploring  the 
effects  of  statistical  techniques  other  than  simple 
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linear  regression  modeling  on  predictive  accuracy  of 
models .     This  research  could  test  whether  predictive 
accuracy  could  be  improved  by  using  alternate  techniques 
and  if  those  techniques  would  be  accepted  by  the 
stakeholders  interested  in  application  of  risk  adjustment 
models  to  health  plan  capitation  payments - 

As  suggested  by  other  studies    (Dowd  1996,; 
Rosencrantz  and  Luft,   1993)   two  part  modeling  could  be 
tested  for  specific  condition  models  to  determine  if  that 
statistical  approach  would  improve  predictive  accuracy. 
The  first  part  of  a  two  part  model  calculates  the 
probability  of  using  the  health  care  system  given  base 
year  enrollee  characteristics-     For  claims-based  models, 
data  on  the  year  one  population  (base  year  enrollees, 
enrollees  who  did  not  use  the  health  care  systems  and 
enrollees  who  did)   is  limited  to  socio -demographic 
information  available  from  enrollment  files. 

Part  two  of  the  process  is  applied  only  to  enrollees 
with  base  year  utilization.     This  step  would  develop  a 
specific  condition  model  using  the  methodology  applied 
here,   to  calibrate  the  effect  size  of  specific  conditions 
present  in  the  base  year  on  year  1  expenditures . 
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The  predicted  value  for  each  individual  and. risk 
class  is  then  based  on  the  probability  of  a  person  with 
those  characteristics  having  a  claim  for  a  specific 
condition  times  the  effect  size  of  the  base  year 
condition  on  year  1  expenditures-     Two  part  and  one  part 
models  could  then  be  compared  based  on  the  four  criteria 
for  evaluating  risk  models;  predictive  accuracy, 
incentives  for  cost  effectiveness,   ease  of 
gaming/auditing  and  administrative  feasibility. 

Finally,   a  sixth  opportunity  for  further  research 
would  be  a  demonstration  project  to  develop  generalized 
software  and  a  guide  to  developing  claims-based  specific 
condition  risk  models.     As  indicated  above,  existence  of 
such  software  and  documentation  would  facilitate 
development  and  evaluation  of  specific  condition  models 
across  populations  and  for  different  objectives  (i.e., 
capitation  or  outcomes  risk  adjustment)  . 


! 
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Benchmark  Tables 


Parameter  Estimate  (Cost  Weight)  Benchmark  T^/->V/i(  #vn 
This  Study  Compared  with  the  JHU/LEWIN  Project  CXjJ\Jy^ 


AAPCC  Proxy  Model 


,                                                                         COST  WEIGHTS 

Variable 

Description 

THIS  STUDY  JHU/LEWIN 

INTERCEPT 

BASE  EXPECTED  PAYMENT 

$1,903 

$1,893 

UMLE 

MALE  GENDER 

$734 

$733 

YRSOVR65 

YEARS  OF  AGE  OVER  65.  34  MAXIMUM 

$106 

$108 

EVERDISA 

EVER  RECEIVED  DISABILITY 

$1,857 

$1,895 

MEDICAID 

MEDICAID  ELIGIBLE  IN  1991 

$1,321 

$1,316 

Parameter  Estimate  (Cost  Weight)  Benchmark 
This  Study  Compared  with  the  JHU/LB/VIN  Project 

AOG/HOSDOM  Model  / 


COST  WEIGHTS^ 


vanaDie 

Description 

Tuic  CTi  inv 

1  ni9  9  1  UUT 

«inu/i.cwiN 

INTERCEPT 

BASE  EXPECTED  PAYMENT 

S424 

$434 

MALE 

MALE  GENDER 

Soio 

 $613 

YRSOVR65 

YEARS  OF  AGE  OVER  65,  34  MAXIMUM   

A 

$64 

 $64 

EVERplSA 

cVcR  RcCcI VcU  DISABIUTY 

$  1,  1  70 

McUICAID 

McDiCAlD  cLIGIBLc  IN  1991 

riaa  at  Least  i  uiagnosis  rrom  Any  encounter 

 «oU^ 

Apiouvjiviyi 

OD/^QAQI  C  U/^CDITAI  I7ATIOM  niA/^M/^CIC 

9  1 , 1 

^1  74Q 

A  Antral  n*? 

TIMF  1  IMfTPr^  MA  lOR 
.  1  IIViC  Llivll  1  CU,  IVIMJVJr> 

*0/  1 

TIMP  1  IMITPn  MA  IHR  PRIMARY  INIPPPTiOKI<5 

AADG9106 

ASTHMA 

$1,214 

$1,216 

AADG9'l07 

LIKELY  TO  RECUR,  DISCRETE 

"S362 

$365 

AADG9lb9 

'  LIKELY  TO  RECUR.  PROGRESSIVE  " 

"'$1,701  " 

$1,696 

AADG91  TV 

CHRONIC  MEDICAL.  UNSTABLE 

$1,416 

$1,415 

AADG9116 

CHRONIC  SPECIALTY.  UNSTABLE.  ORTHOPEDIC 

$597 

$593 

AADG9122  " 

INJURIES/ADVERSE  EFFECTS.  MAJOR 

$465 

$462 

AADG9123 

PSYCHIATRIC.  TIME  LIMITED,  MINOR 

$1,207 

$1,222 

AADG9125 

PSYCHIATRIC.  PERSISTENT  OR  RECURRENT,  UNSTABLE 

$1,089 

$1,088 

AADG9127 

SIGNS/SYMPTOMS.UNCERTAIN 

$572 

$568 

AADG9128"" 

SIGNS/SYMPTOMS,  MAJOR 

$757 

$753 

AADB9132 

MALIGNANCY 

$1,430 

$1,429 

Parameter  Estimate  (Cost  Weight)  Benchmark 
This  Study  Compared  with  the  JHU/LEWIN  Project 


ADG/MOC  Model 

_  ~      ~  COST  WEIGHTS  ~ 

Variabte        Description  "  "  THIS  StUDY  JHU/LEWIN 

INTERCEPT  _BASEEXPECTED  PAYMENT  7!"    '      7""^   ^S599__  _$608 

MALE  '    _    MALE  GENDER  '1  _  _S607__  ^04" 

YRSbviR65  _  S^RS  OF  AGE  OVER  65, 34  MAXIMUM    S67   '_  $57 

EVERDISA_ '  ]EyER  RECEIVED  DISABILITY  '    _  __      ._  S1.1_01   _S1.11?; 

MEDICAID     ^^MEDICAID  ELIGIBLE  IN  1991        _  .  _  __-SZ63  $761 

Had  at  Least  1  1 991  Ambulatory  Vfeft  '  _  _  L.  Z 

VADG9103    J^MEU^rreD7^l^^  _ _ "S548      "  $542 

VADG9104      tlME  UMITED.  MAJOR.  PRMARY  INFECTIONS^   $738  $734' 

VADG9106   "  ASTHMA  .  Z  '^l  _   "  "  $807  _$81 8 

VA'DG9107     _L[KEL_Y  TO  recur.  DISCRETE        "  _S221__     _  ^  $225 

VADG9109      LIKELY  tO_RECUR,  PROGRESSIVE  _  J  _  _     _  '   '  _  S964  J  ^"$965 

yADG91  i  1  '"'chronic  MEDICAL.  UNSTABLE    1  _    "Z-  "  '  '  "  _  _  _  S1_.347  _Si'345' 

VADG9116  " "  CHRdNIC" SPECIALTY.  UNSTABLE.  ORTHOPEDIC  '  _    $652  "    $650  " 

"VAD"g"9'122      INJU'RIES/a'DVERSE  EFFECTS,  major"  _  '  ""$53q;_  $525 

yADG9123  ^psychiatric'  T[ME  LIMITED.  MINOR  _  _'    '       _  '      ZZllZ  $691    __  1_I$698 

VADG9J25    _  P'SYC'HIAiTRIC,>ERSISTENT  OR  RECUR_R^^  __      J  $802   $804; 

VAD'G91'27  "' ^SIGNS/S'YMPTOMS.UNCERTAIN  '  ^   "__    _    _      _  _    $466"  _^0_ 

VAPG9128'   _SIGN's7SYMPfOMS.MAJOR'       _    "  "  _'__       "       _'   $5_56  _$551 

yApG'91_32  '_M^jGN/^^^^       _ ^  '  _     .  „  1  l$ii351  $1.347~ 

  Number  of  1991  Inpatient  Stays:  primary  diagnosis  in  MDC  

MDC9101"  "  'nervous  SYSTEM  "  '  _'_  "  51,540  $1.533 

MDC9103        EARS.NOSE.THROAT.RESPIRATORY  SYSTEMS  _  $3.254  $3.237 

MDC9105 ';__CI[RCULATORY_SYSTEM_  _  "  "  '   _    _   $1,888  $1.897 

M_DC9106__  _  DU3ESTiyE_SYSTEM_  _^  '_  "_  _"  $1.752  $1.759 

MDC91G7    '  "hEPATOBILlIaRY  SYSTEM.  PANCREAS  l  !  _$1.032  $1,030 

MDC9108  "      MUSCULOSKELETAL,  CONNECTIVE  TISSUE     "J'_"     _  $1J06"'   $1,117 

MDC9l'd9     "  SKIN.  SUBCUTANEOUS  TISSUE  AN'd"  BREAST  " "         _    "      _         11,759  $1.762 

MDC91 1 0  '  "  _ EN'DOCRJNE,  NUTRITIONAL,  "METABOlTcTSYSTEMS _  1'     '  ZZ-^Zl  S2/92_4  $2.938" 

MDC9111    '    KIDNEY.  URINARY JRACT  '  '  '        -    •  -  S2.5"59  $2.526 

MDC9118  JNFECTlbuS',  PARASITIC  DISEASES  J '  _  1  11  'S3-009;~  __  $3^661_ 
MD'C9i'19'    1  MENTAL  DISEASE,'aLCOHOL  DRUG  abuse;  "51.950___1  _  It 957_ 

Mbc9i2i_  ^Injuries, POISONINGS, "b'urns  _'  _        l  '_'l     _  IL     ${.832  1_ '$1,882 

MDC9123  "  '    HEALTH  STATUS  f'aCTO'r'S.'TRa'um'a '  "       '  '$1.505  _$"i',48i' 

MpC9_125       ;BL00D,  IM'MUNOLOGICAL.  MYELO.  DISEASES, HIV,  AIDS  _           _     $3.878     "  $3,875" 
MD'C9126  '     TRAN'SPLANTS  -        .    -  -   -  '  $3".945  $3,944 


PART  I:  A  BRIEF  DESCRIPTION  OF  THE  ACQ  METHODOLOGY 


A.  What  Are  ACGs? 

Ambulatory  Care  Groups  (ACGs)  are  a  padent-oriented  measure  of  health  status 
designed  to  be  indicative  of  expected  resource  consumption.  ACGs  can  be  used  to  adjust  for 
differences  in  the  case-mix  of  patient  populations  across  providers  or  health  plans.  ACGs 
have  many  applications  in  the  area  of  ambulatory  care  management,  evaluation,  and 
financing.  More  specifically,  these  include:  physician  profiling,  quality  assurance,  capitation 
and  rate-setting,  patient  case  management,  program  evaluation  and  developmoit,  and 
resource  planning  and  budgeting. 

A  patient  is  categorized  into  a  single  ACG  which  is  calculated  on  the  basis  of  all  the 
diagnosis  codes  they  are  assigned  by  the  ambulatory  providers  treating  them  during  a 
predetermined  period  of  time  (^ically  a  year).  This  global-^isode  orientation  makes  ACGs 
different  from  other  case-mix  measures,  such  as  Diagnosis  Related  Groups  (DRGs)  and 
Ambulatory  Patient  Groups  (APGs),  which  are  calculated  on  the  basis  of  a  single  service  or 
encounter  delivered  by  a  single  provider  at  a  single  point  in  time. 

ACGs  utilize  the  ICD-9-CM  diagnostic  codes  available  on  computerized  insurance 
claims  or  encounter  data  systems.  ACGs  can  also  be  assigned  on  the  basis  of  medical  record 
or  survey  data.  In  addition  to  ICD  codes,  a  person's  ACG  is  determined  on  the  basis  of 
their  age  and  gender. 

ACGs  have  been  described  in  some  detail  in  the  medical/health  services  literature.  A 
recendy  published  overview  article  (spearing  in  the  Journal  HMO  Practiced  is  a  good 
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i:tiat  leaders  unfuniliar  with,  this  body  of  litoatuie  review  this  aiticle. 


in  the  Reference  List  Before  pnxxeding  further  in  this  guide,  it  is 


The  Decision  Rules  Used  to  Assign  ACGs 


The  following  sub-sections  provide  a  summary  of  the  four  basic  steps  used  to  assign 
an  ACG  to  a  patient  Included  in  these  sections  are  discussions  of  three  mid-point  grouping 
methodologies  that  take  place  before  the  "end-point"  ACG  assignment  is  made.  Like  the 
DRG  assignment  process,  one  can  make  use  of  the  final  case-mix  categories  without  fully 


"open-architecture/  these  intermediate  steps  are  described  here. 

Step  1  -  Ambulatory  Diagnostic  Groups  (ADGs) 

The  first  step  entails  identifying  all  unique  ICD-9-CM  codes  attached  to  ambulatory 


claims  submitted  for  one  patient  over  the  period  of  one  year.  (Claims  data  from  a  six-month 
period  may  also  be  sufficient  for  use  in  the  ACG  system.)  Each  ICD-9-CM  code  denoting  g 
primary  diagnosis  or  subsidiary  diagnoses  is  assigned  to  one  of  34  Ambulatory  Diagnostic 
Groups  (or  ADGs).  Although  a  diagnosis  code  is  grouped  into  only  one  ADG,  many 
different  ICD-9-CM  codes  comprise  each  ADG.  Therefore,  a  patient  being  treated  for  two 
distinct  diagnoses  may  M  into  two  separate  ADGs  or  only  one  ADG,  depending  on  tfie  type 
of  diagnoses.  For  instance,  a  patient  with  both  Obstnictiye  Chronic  Bronchitis  (ICD-9-CM 
code  491.2)  and  Congestive  Heart  Failure  (ICD-9-CM  code  428.0)  will  fell  into  only  one 
ADG,  Chronic  Medical:  Unstable  (ADG  11). 


understanding  all  of  the  interim  steps.  Howeva*,  in  keeping  with  the  ACG*s  philosophy  of 
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An  cQimiioa  ICb-9<IM  codes  were  assigned  into  ah  ATO  The  initial 

a^wgnnwnt  nfTrp-O-CM:  codes  to  ADGs  was  done  by  a;  team  ofDhysicians  and  specialists 
using  utilization  daci  for  160,000  patients  from  several  test  sites.  The  aiteria  which  were 
used  as  d^enninants  oif  anticq)ated  health  care  resource  use  are: 

•  Clinical  judgments  conceming  the  likelihood  of  persistence  or 
recurrence  of  the  problem 

•  Likelihood  of  return  visits  and/or  the  need  for  continued  treatment 

•  Likelihood  of  the  need  for  specialist  services 

•  Likelihood  of  decreased  life  expectancy 

•  Likelihood  of  short-term  or  long-term  patient  disabili^ 

•  Expected  need  and  cost  of  relevant  diagnostic  and  therapeutic 
procedures  - 

•  Likelihood  of  a  required  hospitalization 

Table  1  lists  three  frequently  occurring  ICD-9-CM  codes  within  each  ADG.  For  a  more 
complete  listing  of  ICD-9-CM  codes  and  their  disposition  into  ADGs,  please  see  Appendix  2. 
(Also,  for  a  further  discussion  of  the  rationale  and  philosophy  of  the  ICD  to  ADG  assignment 
process,  please  refer  to  References  #2  and  3). 

It  should  be  noted  that  at  the  present  time  there  remain  some  infrequently  used  ICD-9- 
CM  codes  which  have  not  as  yet  been  assigned  an  ADG;  these  codes  will  be  rejected  and 
termed^  "Bad  ICDs"  when  claims  are  run  through  the  ACG  Grouper  software^  "Bad  ICDs" 

'   To  date  approximately  6,000  ICD-9  codes  have  been  assigned  to  ADGs.  These  typically 
account  for  over  97%  of  all  visits  within  a  population  group. 
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can  also  result  finom  the  'illegal*  truncation- of  four  oi  five  digiHCD-^^M  diagnosis  codes  -* 
tp  tfae  thxee  or  four  digit  leveL  Some,  but  not  all,,  three  or  four  (figit  'root'  codes  are 
commonly  used  and  accepted  for  assignment  to  an  s^ypropnate  ADG. 
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Amhulflfnry  Diagnostic  Groap 


I!CI)-^-CM;  DxagDOsfe  Code 


1      Time  LimitediMinor 


558.9     Noninfectious  Gastroenteritis  and 
Colitis 

372.0     Acute  Conjunctivitis 
380,4     Impacted  Cerumen 


Time  Limited:  Minor-Primary 
Infections 


490  Bronchitis 

462  Acute  Pharyngitis 

463  Acute  Tonsillitis 


3      Time  Limited:  Major 


789.0      Abdominal  Pain 

401.0  Malignant  Essential  Hypertension 

622. 1  Dysplasia  of  Cervix  (Uteri) 


4     Time  Limited:  Major-Primary 
Infections 


486  Pnemomia,  Organism  Unqjedfied 
682.9     Cellulitis  an  Abscess  of 

Un^jecified  Sites 
573.3     Hepatitis,  Unspecified 


5  Allergies 


477.9      Allergic  Rhinitis,  Cause 

Unspecified 
995.3      Allergy,  Unspecified,  Not 

Elsewhere  Classified 
708.9     Unspecified  Urticaria 


6  Asthma 


493.0  Extrinsic  Asthma 

493.1  Intrinsic  Asthma 
493.9     Asthma  Unspecified 


7     Likely  to  Recur:  Discrete 


381.1  Chronic  Serous  Otitis  Media 
616.1  Vaginitis  and  Vulvovaginitis 
724.5     Backache,  Unspecified 


8      Likely  to  Recur  Discrete-Infections 


9      Likely  to  Recur:  Progressive 


599.0      Urinary  Tract  Infection 
382.9     Unspecified  Otitis  Media 
034.0     Strepococcal  Sore  Throat 


434.0     Cerebral  Thrombosis 
577.0      Acute  Pancreatitis 
707.0     Deubitus  Ulcer 
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Iz^',   .<v;  ^  "A^^a  "^vlii^  ^?iar?>rf J    Table    -        * .^^^  ^  -*\Hvt-  -     xt^;t'-fi^^^v^^:?*j.<?s5<o j 

m     ■•  '                    -        -                 f            'yi^^^i---'- -      ^                    'X-    <■             .w^^J  O^V/^V'- -     -                                           ^           -         "  I 

1        Ambulatory  Diagnostic  Group 

ICD-9^-CMDiiag]iosB  Code  | 

llO      Chronic  Medical:  Stable 

401.9     Essential  Hypertension  1 
250.00    Adult-Onset  Type  Diabetes  1 
Mellitus  1 
278.0     Obesity  1 

111      Chronic  Medical:  Unstable 

282.6     Sickle-Cell  Anemia  1 
345.9     Epilepsy,  Unspecified  1 
428.0     Congestive  Heart  Failure  | 

|l2      Chronic  Specialty:  Stable-Orthopedic 

721.0     Cervical  Spondylosis  Without  1 
Myelopathy  1 
718.8     Odier  Joint  Derangement  1 
735.4     Hammer  Toe  | 

■13      Chronic  Specialty:  Stable-Ear,  Nose, 
1  Throat 

389.9     Unspecified  Hearing  Loss  1 
385.3     Cholesteatoma  1 
478.3     Paralysis  of  Vocal  Cords  or  1 
Larynx  | 

jl4      Chronic  Specialty:  Stable-Eye 

379.3     Aphakia  1 
372.9     Unspecified  Disorder  of  1 
Cojunctiva  1 
367.0     Hypermetropia  | 

■15      Chronic  Specialty:  Stable-Other 

256.4     Polycystic  Ovaries  1 
V45.0     Postsurgical  Cardiac  Pacemaker  1 
in  situ  1 
360.0     Blindness,  bilateral  1 

jl6      Chronic  Specialty:  Unstable- 
1  Orthopedic 

726.0     Adhesive  Capsulitis  of  Shoulder  1 
722.2     Displacement  of  Intervertebral  1 
Disc  1 
732.7      Osteochondritis  Dissecans  | 

|l7      Chronic  Specialty:  Unstable-Ear, 
1       "Nose,  Throat 

473.9      Chronic  Sinusitis  1 
381.5      Eustachian  Salpingitis  1 
386.0     Meniere's  Disease  | 
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AmbuMoi?' Diagnostic  Groap 


ICD-9-CM!  Diagnosis  Code 


118     Chronic  Specialty:  Unstable-Eye 


365.9     Unspecified  Glaucoma 

379.0  Sderitis 

366. 1  Senile  Cataract 


19      Chronic  Specialty:  Unstable-Other 


617.9     Endometriosis,  Site  Unspecified 
348.2     Benign  Intracxanial  Hypertension 
331.4     Obsructive  Hydrocephalus 


50  Dennatologic 


706. 1      Othe  Acne 

695.4     Lupus  Erythematosus 

078.1      Viral  Warts 


il      Injuries/ Adverse  Effects:  Minor 


920        Contusion  of  Face,  Scalp,  and 
Neck 

847.0  Neck  Sprain 

959.1  Injury  to  Trunk 


122     Injuries/ Adverse  Effects:  Major 


854.0     Intracranial  Injury 

807.0  Qosed  Fracture  of  Rib(s) 

972. 1  Poisoning  by  Unspecified  Drug  or 
Medicinal  Substance 


13      Psychosocial:  Major 


300.4  Neurotic  Depression 
311  Depressive  Disorder 
303.0     Acute  Alchoholic  Intoxication 


14     Psychosocial:  Other 


317        Mild  Mental  Retardation 
309.0     Adjustment  Reacton  with  Brief 

Depression 
799.2  Nervousness 


Psychophysiologic 


564.1     Irritable  Colon 
346.9     Migraine,  Unspedified 
625.3  Dysmenorrhea 


•6  _  Signs/Symptoms:  Minor 


784.0  Headache 

729.5     Pain  in  Limb 

787.0     Nausea  and  Vomiting 
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7     Signs/Symptoms:  Uncertain 


716.9     Unspecified  Arthropathy 
719.4     Pain  in  Joint 
785. 1  Palpitations 


Signs/Symptoms:  Major 


786.5  Chest  Pain 

780.6  Pyrexia  of  Unknown  Origin 
780.3  Convulsions 


9  Discretionary 


550.9  Inguinal  Hernia 
706.2  Sebaceous  Cyst 
455.6     Unspecified  Hemorrhoids 


0      See  and  Reassure 


611.1  -  Hypertrophy  of  Breast 
562, 1     Diverticula  of  Colon 
734        Flat  Foot 


1      Prevention/ Administrative 


V20.2     Routine  Infant  or  Child  Health 
Check 

V72.3     Gynecological  Examination 
V25.9     Unspecified  Contraceptive 
Management 


2  Malignancy 


174.9     Malignant  Neoplasm  of  Breast 
(Female) 

201.9     Hodgkin's  Disease,  Unspecified 
Type 

162.9     Malignant  Neoplasm  of  Bronchus 
and  Lung 


3  Pregnancy 


34  Dental 


V22.2     Pregnant  State 
650        Delivery  in  a  Completely  Normal 
Case 

V24.2     Routine  Poatpartum  FoUow-Up 


521.0  Dental  Caries 

523.1  Chronic  Gingivitis 

525.9     Unspecified  Disorder  of  the  Teeth 
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step  1  -  Collamed  ADGs  rCABGsV  -  -   '   

To  develop  a  reasonable  number  of  "mid-point*  gioq>s,  the  34  ADG  cat^ories  were 

collapsed  into  12  'colI^}sed  ADGs"  (CADGs).  Each  CADG  is  a  group  of  ADGs  that  is 
proved  to  be  similar  with  regard  to  the  likelihood  of  persistence/reoccunence  of  the 
diagnoses  within  them.  The  twelve  CADGs  and  the  ADGs  which  comprise  them  are  shown 
in  Table  2. 

Step  3  -  The  Maior  Ambulatory  Categories  (MACs) 

After  CADGs  were  constructed,  the  next  step  in  the  development  prcx:ess  was  to 
identify  mutually  exclusive  combinations  of  CADGs.  The  common  naturally  occurring 
combinations  of  CADGs,  of  which  there  were  23,  were  identified  and  labeled  Major 
Ambulatory  Categories  (MACs).  Two  additional  MACs  were  developed,  for  a  total  of  25 
MACs.  These  additional  two  MACs  include  one  for  the  uncommon  CADG  combinations 
and  one  for  persons  who  did  not  have  any  diagnoses  entered  during  the  period  under  review 
(i.e.  no  visits  or  no  disease) ^  Preventive/administrative  visits  (CADG  11)  may  be  present  in 
any  of  the  MACs  as  well.  Patients  are  assigned  to  one  MAC  only,  as  the  first  step  in  the 
ACG  "branching  process."  Table  3  lists  the  25  MACs  and  the  CP^DQz  which  comprise 
them. 
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Table2 

The  rMiBjMcmd  Ambulatory  Diagnosdc  Groop  (CADG)  Clusten 
and  the  Ambolatwy  Diagnostic  Groiqis  (ADGs)  that  Comprise  Them 


CblbpseiIAIK^(C::ADG) 


1      Acute  Minor 


1  Tiine  T  iim'*^'  Minor 

2  Time  f  i^tmti'  Minor-  Primary  Infections 
21  Ihjuries/Adveise  Events:  Minor 

26  Signs/Symptoms:  Minor  


2      Acute  M^or 


3  Time  t  imi*fA'  Mjy'or 

4  Time  t  imitmA-  Major-Pximary  Infections 
22  Injuries/Adverse  Events:  Major 

27  Signs/Syn^xoms:  Uncertain 

28  Signs/Symptoms:  Msgor 


3       LiJceiy  to  Recur 


5  Allergies 

7  Likely  to  Recur  Discrete 

8  Likely  to  Recur  Discrete^nfections 
20  Dennatologic 
29  Discretionaiy 


4  Asthma 


Asthma 


5       Chronic  Medical:  Unstable 


9       Ukely  to  Recur  Progressive 
1 1      Chronic  Medical:  Unstable 
32  Malignancy 


6       Chronic  Medical:  Stable 


10  Chronic  Medical  Stable:  Stable 
30      See  and  Reassure 


7       Chronic  Specialty:  Stable 


12  Chronic  Specialty^  Stable-Orthopedic 

13  Chronic  Specialty:  Stable:£ar,  Nose,  Throat 
15      Chronic  Specialty:  Stable-Other 


8  Eye/Dental 


14  Chronic  Specialty:  Stable-Eye 
34  Dental 


9       Chronic  Specialty:  Unstable 


16  Chronic  Specialty:  Unstable  Orthopedic  Chronic 

17  Specialty:  Unstable:Ear,  Nose,  Throat 

18  Chronic  Specialty:  Unstable-Other 

19  Chronic  Specialty:  Unstable-Other  


10  Psychosocial/Psychophysiologic 


23  Psychosocial:  M;yor 

24  Psychosocial:  Other 

25  Psychophysiologic 


1 1  —  Prevention/ Administrative 

12  Pregnancy 


31  Prevention/ Admimstrative 
33  Pregnancy 
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1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 

23 
24 
25 


Acute:  Minor 
Acute:  M^or 
Likely  to  Recur 
Asthma 

Girooic  Medical:  Unstable 
Chronic  Medical:  Stable 
Chronic  Specialty:  Stable 
Eye/Dental 

Chronic  Specialty:  Unstable 
Psychosocial/Psychophysiologic 
PreveatioQ/Admimstrative 
Pregnancy 

Acute:  Minor  and  Major 

Acute:  Minor  and  Likely  to  Recur 

Acute:  Minor  and  Chronic  Medical:  Stable 

Acute:  Minor  and  Eye/Dental 

Acute:  Minor  and  Psychosocial/Psycfaophys. 

Acute:  Major  and  Likely  to  Recur 

Acute:  Minor  and  Major  and  Likely  to  Recur 

Acute:  Minor  &  Likely  to  Recur  &  Eye/Dcatal 

Acute:  Minor  &  Likely  to  Recur  &  Psychosoc. 

Acute:  Minor  &  Major  &.  Likely  to  Recur  &  Chronic 
Medical:  Stable 

Acute:  Minor  &  Major  Sc.  Likely  to  Recur  <&  Psychosocial 
All  Other  Combinations  Not  Listed  Above. 
No  Visit  or  No  ADC 


Note:  CADG-1 1  (Preventive/ Administration)  ma^  also  be  present  in  MACs  1 
categorization.  See  Table  2  for  the  Detailed  listing  of  CADGs. 


-23  without  affecting  the  patient's 


5;tep  4  -  The  ACQ  "Decision  Tree' 

Finally,  based  on  a  patient's  age,  gender,  and  the  number  of  ADGs  wiiich  aze 
rq>resented  in  their  respective  MAC,  one  and  only  one  ACG  is  assigned  to  fte  patient,  based 
on  die  decision  tzee  shown  in  Hgure  1.  To  summarize,  the  ADGs  for  each  person  are 
enumerated  and  then  combined  into  CADGs,  which  then  form  the  basis  of  the  MAC  "tree 
branches"  which  ultimately  lead  to  the  end-point  ACG  categories.  Assignment  to  an  ACG 
takes  place  directly  for  18  of  the  MACs  Ci.e.  MAC  2,  4-9,  11-13.  15,  16,  18,  20-23,  25); 
for  the  other  7  MACs  (i.e.  MAC  1,  3,  10,  14,  17,  19,  24),  assignment  is  made  on  the  basis 
of  one  or  more  of  the  following:  age,  presence  of  absence  of  a  specific  ADG,  number  of 
ADGs,  and  gender.  For  instance,  a  patient  who  has  been  assigned  to  MAC  1  faces  a  three- 
way  branch  based  on  age  before  they  are  placed  into  ACGs  1,  2,  or  3.  A  person  in  MAC  3 
is  assigned  either  into  ACG  6  or  5  dependent  on  whether  or  not  they  had  at  least  one 
diagnosis  code  falling  into  ADG  5  (Allergies).  A  person  in  MAC  24  is  assigned  a  final  ACG 
based  on  their  age,  gender  and  number  of  different  ADGs.  Decisions  concerning  these 
particular  ACG  combinations  were  made  based  on  relatively  unique  use  patterns  of  the 
individuals  who  experienced  them.  (A  more  detailed  description  of  the  ACG  decision  tree  is 
available  in  Reference  #2).  A  listing  of  the  51  ACG  end-groups  is  presented  as  Table  4. 
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1 

I 


^^^^^^^^^^^ 


L  ^ 

Acute  Minor,  Age  <  1 

1  2 

Acute  Minor,  Age  2-5 

1  ^ 

Acute  Minor,  Age  6+  1 

1  ^ 

Acute:  Major  1 

1  ^ 

Likely  to  Recur,  whhout  Allergies  | 

1  ^ 

Likely  to  Recur,  with  Allergies  | 

Asthma  | 

1  ^ 

Chronic  Medical,  Unstable  | 

1  ^ 

Chronic  Medical,  Stable  | 

1  10 

Chronic  Specialty  | 

1  11 

Opthalmological/Dentai  | 

1  12 

Chronic  Specialty,  Unstable  | 

1 

Psychosocial,  without  Psychosocial  Major  | 

1 

Psychosocial,  with  Psychosocial  Major,  without  Psychosocial  Minor  | 

1 

Psychosocial,  with  Psychosocial  Major,  with  Psychosocial  Minor  | 

1  1^ 

Preventive/Administrative  | 

1 

Pregnancy  1 

18 

Acute  Minor  and  Acute  Major  | 

1 

Acute  Minor  and  Likely  to  Recur  Discrete,  Age  <  1  | 

20 

Acute  Minor  and  Likely  to  Recur  Discrete,  Age  2-5  | 

'  21 

Acute  Minor  and  Likely  to  Recur  Discrete,  Age  >  5,  Without  Allergy  | 

22 

Acute  Minor  and  Likely  to  Recur  Discrete,  Age  >  5,  With  Allergy  | 

23 

Acute  Minor  and  Chronic  Medical:  Stable  | 

24 

Acute  Minor  and  Eye/Dental  | 

25 

Acute  Minor  and  Psychosocial  Without  Psychosocial  Major  | 

16 


A 


^^^^^^^ 

1  16 

Acute  Minor  and  Psydiosodal  With  Psychosocial  M^jor,  without  Psychosocial  minor  1 

1  27 

Acute  Minor  and  Psychosocial  with  Psychosocial  Major  and  Minor  1 

1  28 

Acute  Major  and  Likely  to  Recur  Discrete  1 

1  2^ 

Acute  Minor/ Acute  Major/Likely  to  Recur  Discrete,  Age  <  2  1 

1  30 

Acute  IVCnor/ Acute  Maior/Likelv  to  Recur  Discrete.  A^e  2S  1 

1  31 

Acute  Minor/ Acute  Maior/Likelv  to  Recur  Discrete.  A^e  1 

1  ^2 

Acute  Minor/ Acute  Maior/Likelv  to  Recur  Discrete.  Ase  >  5.  Without  Allerzv  1 

1  33 

Acute  Minor/ Acute  Major/Likely  to  Recur  Discrete,  Age  >  5,  With  Allergy  1 

1  ^ 

Acute  Minor/Likely  to  Recur  Discrete/Eye  &  Dental  1 

1 

Acute  Minor/Dkdy  to  Recur  Discrete/Psychosodal  | 

1  36 

Acute  Minor/ Acute  Major/Likely  to  Recur  Discrete/Eye  &  Dental  | 

1  ^'^ 

Acute  Minor/ Acute  Major/Likely  to  Recur  Discrete/Psychosodal  | 

38 

2-3  Other  ADG  Combinations,  Age  <  17  | 

39 


2-3  Other  ADG  Combinations,  Males  Age  17-34 


40 


2-3  Other  ADG  Combinations,  Females  Age  17-34 


1 

2-3  Other  ADG  Combinations,  Age  >  34  | 

1  42 

4-5  Other  ADG  Combinations,  Age  <  17  | 

1  43 

4-5  Other  ADG  Combinations,  Age  17-44  | 

1  44 

4-5  Other  ADG  Combinations,  Age  >  44  | 

1 

6-9  Other  ADG  Combinations,  Age  <  6  | 

1  46 

6-9  Other  ADG  Combinations,  Age  6-16  | 

6-9  Other  ADG  Combinations,  Males  Age  17-34  | 

1  48 

6-9  Other  ADG  Combinations,  Females  Age  17-34  | 

1  49 

6-9  Other  ADG  Combinations,  Age  >  34  | 

1  50 

10+  Other  ADG  Combinations  | 

1 

No  Visits  and/or  No  ADGs  1 
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Standard  Analytical  File 
Inpatient 


2.  RECORD  DESCRIPTION  for  Variable/Packed  Files:^ 


Record  Name:  Inpatient 

Single  Output  [Variable  Length,  Packed,  and  Signed] 


Data  Element  Picture  Positions 

Beg  End 

01  Record  Identification  X(01)  01  01 
Code  (RIG) 

02  Qaim  Near  line  X(01)  02  02 
Record  Version  Code 

03  Beneficiary  Qaim  X(09)  03  11 
Accoimt  Niunber 

04  Category    Equatable  X(02)  12  13 
Beneficiary  Identification 

Code  (BIC) 

05  Beneficiary  X(02)  14  15 
Identification  Code 

(See  Attachment  A) 

06  Beneficiary  Residence  X(02)  16  17 
SSA  Standard  State  Code 

(See  Attachment  B) 

07  Beneficiary  Residence  X(03)  18  20 
SSA  Standard  County 

Code 

08  Beneficiary  State  X(01)  21  21 
Segment  Near  Line 

Code 


09  Beneficiary  Mailing  X(05)          22  26 
Contact  ZIP  Code 

10  Beneficiary  Sex  X(01)          27  27 
Identification  Code 


Brief  Description 


Identifies  the  record 
format  being  processed 

Identifies  record  version 
of  Near  Line  file 

Identifies  primary 
beneficiary 


Categorizes 
BICs 


groups  of 


Identifies  relationship 
between  individual  and 
primary  beneficiary 

Identifies  two  digit 
SSA  code 


Identifies  three  digit 
SSA  code 


Identifies  Near  line  file 
segment  that  contains 
beneficiary's  record  for 
specific  service  year 

Actual  code 


Identifies  sex  of 
beneficiary 


A  detailed  description  and  the  coding  scheme  for  each  data  element  are  included  in 
Section  4,  DATA  ELEMENT  DESCRIPTIONS. 
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Data  Element  Picture 
11    Benefidaiy  Race  Code  X(01) 


12    Beneficiary  Birth  Date 


13  Benefidaiy  Medicare 
Status  Code  [MSq 
(See  Attachment  C) 

14  HCFA  Claim 
Processing  Date 


15    Qaim  From  Date 


16    Qaim  Thru  Date 


17  Benefidary  Common 
Working  File  (CWF) 
Location  Code 


18    CWF  Qaim 
Accretion  Date 


19    CWF  Qaim 

Accretion  Number 


S9(09) 
COMP 

X(02) 


S9(09) 
COMP 


S9(09) 
COMP 


X(01) 


20    Claun  Disposition  Code  X(02) 


Positions 
Beg  End 

28  28 


29  32 
33  34 

35  38 


S9(09)  39  42 

COMP 


43  46 


47  47 


59(09)  48  51 

COMP 


S9(03)  52  53 

COMP-3 


21    Fiscal  Intermediary 
(FI)/Carrier 
Identification  Number 


X(05) 


54  55 


56  60 


Brief  Description 

Identifies  race  of 
beneficiary 

Date  of  beneficiary's 
buth 

Identifies  reason  for 
benefidary's  entidement 


Date  weekly  HCFA 
quality  assurance/ 
database  load  process 
cycle  begins 

First  day  of  billing 
statement  for  services 
rendered 

Last  day  of  billing 
statement  for  services 
rendered 

Identifies  CWF  location 
(host  site)  where 
benefidary's  record  is 
maintained 

Date  the  claim  is 
posted/processed  to 
master  record 

Sequence  number 
assigned  to  claim  when 
posted/processed  to 
master  record 

Identifies  outcome  of 
processing  of  claim 

Assigned  by  HCFA  to 
authorized  FI  or  carrier 
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Data  Element 

22  Fl/Camer  Qaim 
Control  Number 

23  Fl/Carrier  aaim 
Receipt  Date 

24  Fl/Carrier  Qaim 
Processing  Date 

25  Fl/Carrier  Qaim 
Payment  Date 

26  Provider  Nmnber 

27  Qaim  Query  Code 


28    Qaim  Facility 
Type  Code 


29    Qaim  Service 

Qassification  Type 
Code 


30    Claim  Frequency  Code 


31    Payment  and  Edit 
Record  Identification 
Code 


Picture  Positions 

Beg  End 

X(15)  61  75 

S9(09)  76  79 
COMP 

S9(09)  80  83 
COMP 

S9(09)  84  87 
COMP 

X(06)  88  93 

X(01)  94  94 

X(01)  95  95 

X(01)  96  96 


X(01)  97  97 

X(01)  98  98 


Brief  Description 

Unique  control  number 
assigned  by  FI  or  carrier 
to  claim 

Date  FI  or  carrier 
receives  claim 

Date  FT  or  carrier 
processes  and  releases 
claim 

Scheduled  date  of 
payment  as  shown  on 
original  claim 

Number  assigned  to 
provider  of  services 

Indicates  type  of  claim 
being  processed  based  on 
payment 

First  digit  of  type  of 
claim  code  indicating 
type  of  facility  that 
provided  beneficiary  care 

Second  digit  of  type  of 
claim  code  indicating 
classification  of  type  of 
service  provided  to 
beneficiary 

Third  digit  of  type  of 
claim  code  indicating  the 
sequence  of  claim  in 
current  episode 

Identifies  type  of  form  on 
which  claim  originated 
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Data  Element 


32    Qaim  Traiisaction 
Code 


33    Medicaid  Provider 
Identification  Number 


34  Claim  Provider 
Payment  Amount 

35  Patient  Control 
Nmnber 


36  Qaim  Medical 
Record  Nmnber 

37  Qaim  Treatment 
Authorization  Number 


38  Beneficiary  Primary 
Payer  Code 

39  Beneficiary  Primary 
Payer  Qaim  Payment 
Amount 

40  Intermediary  Qaim 
Action  Code 


41    Intermediary  Requested 
Qaim  Cancel  Reason 
Code 


Picture  Positions 
Beg  End 

X(01)  99  99 


X(12)  100  111 

S9(07)V99  112  116 
COMP-3 

X(17)  117  133 

X(17)  134  150 


X(10)  151  160 


X(01)  161  161 


S9(05)V99  162  165 
COMP-3 


X(01)  166  166 


X(01)  167  167 


Brief  Description 

Indicates  type  of  claim 
submitted  by  provider. 
Code  is  used  for  claims 
processing. 

Unique  ID  number  assigned 
to  each  provider  by  State 
Medicaid  agency 

Payment  to  provider  for 
services  covered  by  claim 

Unique  alphanumeric 
identifier  assigned  by 
provider  for  retrieving 
records 

Assigned  by  provider  to 
assist  in  record  retrieval 

Assigned  by  medical 
reviewer  to  identify 
treatment  authorization 
action 

Identifies  type  of  primary 
payer  for  claim 

Amount  of  payment  for 
beneficiary  by  primary 
payer 

Identifies  type  of  action 
taken  by  intermediary  on 
claim 

Identifies  reason 
intermediary  cancelled  a 
previous  claim 
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Data  Element 


42  aaim  Primary  Care 
Physician  Identification 
Nimiber 

43  Qaim  Status  Code 


44  Qaim  Group  Health 
Organization  (GHO) 
Paid  Switch 

45  Qaim  Admitting 
Diagnosis  Code 

46  aaim  Active  or 
Covered  Level 
Care  Thru  Date 

47  Qaim  Cost  Report 
Medicare  Covered 
Days  Count 


48    Qaim  Covered 
Days  Count 


49  Beneficiary  Inpatient 
Deductible  Amount 

50  Qaim  Medicare 
Non-Payment  Reason 
Code 

51  Qaim  Non-Covered 
Day  Count 


Picture 


X(10) 


X(02) 


X(01) 


X(05) 


X(01) 


Positions 
Beg  End 

168  177 


178  179 


180  180 


181  185 


S(09)  186  189 

COMP 


S9(03)  190  191 

COMP-3 


S9(03)  192  193 

COMP-3 


S9(03)V99  194  196 
COMP-3 


197  197 


S9(05)  198  200 

COMP-3 


Brief  Description 


State  license  number  of 
certifying  or  attending 
physician 

Identifies  status  of 
beneficiary 

Specifies  whether  GHO  has 
paid  provider  for  claim 


Indicates  beneficiary's 
initial  diagnosis  at  admission 

Date  on  claim  for  which 
covered  level  of  care  or 
active  care  ended 

Number  of  days  that  would 
have  been  Medicare  covered 
if  another  primary  payer 
were  not  involved  or  if  a 
beneficiary  had  fewer  days 
available  than  were  needed 
by  a  PPS  bill 

Nmnber  of  covered  days  of 
care  charged  to  Medicare 
facility  utilization 

Amount  paid  for  IP  services 
as  originally  submitted 

Reason  no  Medicare 
payment  is  made 


Number  of  days  not 
chargeable  to  Medicare 
facility  utilization 
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Data  Element 


52    Qaim  Pass  Through 
Per  Diem  Amomit 


53  Qaim  Principal 
Procedure  Pl^cian 
Identification  Number 

54  Qaim  Provider 
Guaranteed  Payment 
Start  Date 

55  Qaim  Admission 
Date 

56  ClaimReadmission 
Switch 


57  Beneficiary  Lifetime 
Reserve  Days  (LRD) 
Used  Coimt 

58  Beneficiary  Medicare 
Benefits  Exhausted 
Date 

59  Beneficiary  Part  A 
Coinsurance  Liability 
Amount 

60  Qaim  Source  Inpatient 
Admission  Code 


61  Beneficiary  Total 
Coinsurance  Day 
Count 


Picture 


Positions 
Beg  End 


S9(05)V99  201  204 
COMP-3 


X(10) 


S9(09) 
COMP 


S9(09) 
COMP 

X(01) 


S9(09) 
COMP 


S9(05)V99 
COMP-3 


X(01) 


205  214 

215  218 

219  222 

223  223 


S9(03)  224  225 
COMP-3 


226  229 


230  233 


234  234 


39(03)  235  236 
COMP-3 


Brief  Description 


Amount  of  reimbursable 
costs  for  current  year  divided 
estimated  Medicare  days 
for  current  year 

State  license  number  of  the 
physician  who  performed 
principal  procedure 

Date  guaranteed  payment 
to  provider  started 


Date  beneficiary  admitted 


Indicates  whether  benefidaiy 
readmitted  to  IP  facility 
within  seven  days  of  prior 
discharge 

Number  of  LRD  benefidaiy 
has  elected  to  use  during 
period  covered  by  claim 

Last  date  for  which 
beneficiary  has  Medicare 
coverage 

Amount  for  which 
intermediary  has  determined 
the  beneficiary  is  liable 

Specifies  means  (by  type  of 
admission)  by  which 
beneficiary  was  admitted  to 
IP  facility 

Coimt  of  total  number  of 
coinsurance  days  associated 
with  beneficialys  stay  in 
facihty 
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Data  Element 


Picture         Positions  Brief  Description 

Beg  End 


62  Claim  TP  Admission 
Type  Code 

63  Qaim  Utilization 
Review  (UR)  Notice 
Received  Date 


X(01)  237  237 

S9(09)  238  241 
COMP 


64  Qaim  Diagnosis  X(03)  242  244 
Related  Group 

(DRG)  Code 

65  Qaim  DRG  OutUer         X(01)  245  245 
Stay  Code 


66  Qaim  DRG  Outlier 
Approved  Payment 
Amount 

67  Qaim  Blood  Pints 
Furnished  Quantity 

68  Qaim  Blood  Pints 
Replaced  Quantity 

69  Qaim  Blood  Pints 
Not  Replaced  Quantity 


S9(07)V99  246  250 
COMP-3 


S9(03)  251  252 
COMP-3 

S9(03)  253  254 
COMP-3 

S9(03)  255  256 
COMP-3 


Qaim  Blood  Deductible 
Pints  Quantity 

Beneficiary  Blood 
Deductible  Liability 
Amount 


S9(03)  257  258 
COMP-3 

S9(05)V99  259  262 
COMP-3 


72    Claim  IP  Peer  Review       X(01)  263  263 

Organization  (PRO) 
Approval  Type  Code 


Indicates  type  and  priority 
of  IP  admission 

Date  SNF  receives  UR 
committee  finding  that 
admission  or  further  stay  no 
longer  necessary 

DRG  to  which  hospital 
daim  belongs  for  prospective 
payment 

Indicates  beneficiary  stay 
under  FPS  that  has  unusualfy 
long  length  or  exceptionally 
high  cost 

Amount  of  additional 
payment  s^roved  by  PRO 
because  of  outUer  situation 
for  a  beneficiary's  stay 

Number  of  whole  pints 
furnished  to  beneficiary 

Nmnber  of  whole  pints 
that  have  been  replaced 

Number  of  whole  pints 
that  have  not  been 
replaced 

Quantity  of  blood  pints 
appUed  to  the  claim 

Amount  for  which  the 
intermediary  determined  the 
beneficiary  is  Uable  for  the 
blood  deductible 

Specifies  type  of  approval 
or  denial  determined  by 
PRO 
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Data  Element 


73    Claim  PRO  Approval 
Grace  Day  Comit 


74  Qaim  PRO  Approval 
Service  From  Date 

75  Claim  PRO  i^proval 
Service  Thru  Date 

76  Benefidaiy  Six 
Position  Surname 
Name 

77  Beneficiary  First 
Initial  Given  Name 

78  Beneficiary  First 
Initial  Middle  Name 

79  Qaim  Total 
Prospective  Payment 
System  (PPS)  Capital 
Amount 

80  Qaim  PPS  Capital 
Hospital-Specific 
(HSP)  Amount 

81  Qaim  PPS  Capital 
Federal-Specific 
(FSP)  Amount 

82  Qaim  PPS  Capital 
Outlier  Amount 


83    Qaim  PPS  Capital 
Disproportionate 
Share  Amount 


Picture 


S9(01) 
COMP-3 


Positions 
Beg  End 

264  264 


89(09)  265  268 

COMP 

89(09)  269  272 

COMP 


X(06) 


X(01) 


X(01) 


273  278 


279  279 


280  280 


89(07)V99  281  285 
COMP-3 


S9(07)V99  286  290 
COMP-3 


89(07)V99  291  295 
COMP-3 


89(07)V99  296  300 
COMP-3 


S9(07)V99  301  305 
COMP-3 


Brief  Description 


Number  of  d^  determined 
by  PRO  as  necessary  to 
arrange  post-discharge  care 

Start  date  of  service  that  has 
been  approved  by  the  PRO 

End  date  of  service  that  has 
been  approved  by  the  PRO 

Identifies  the  first  six 
positions  of  the 
beneficiary's  surname 

Specifies  first  initial  of 
beneficiary's  given  name 

Specifies  first  initial  of 
beneficiary's  middle  name 

Total  amount  payable  for 
cs^ital  PPS  for  the  claim 


Amount  of  HSP  portion  of 
PPS  payment  for  capital 


Amount  of  FSP  portion  of 
PPS  payment  for  capital 


Amount  of  the  outlier 
portion  of  the  PPS  payment 
for  capital 

Amount  of  the 
disproportionate  share 
portion  of  the  PPS  payment 
for  capital 
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Data  Element 


84    QaimPPS  Capital 
Indirect  Medical 
Education  (IME) 
Amount 


85    Claim  PPS  Capital 
Exception  Amount 


Picture 


Positions 
Beg  End 


S9(07)V99  306  310 
COMP-3 


S9(07)V99  311  315 
COMP-3 


86  QaimPPS  Old 
Capital  Hold 
Harmless  Amoimt 

87  Qaim  PPS  Capital 
Discharge  Fraction 
Percent 

88  Qaim  PPS  Capital 
DRG  Weight  Number 


S9(07)V99  316  320 
COMP-3 


S9V9(04)  321  323 
COMP-3 


S9(03)V9(04)324  327 
COMP-3 


Brief  Description 

Amoimt  of  the  IME  portion 
of  the  PPS  payment  for 
capital 


Amount  of  capital  PPS 
exception  payments  for 
hospitals  with  high  levels  of 
capital  obUgations 

Identifies  the  hold  hamiless 
amount  payable  for  old 
capital 

Specifies  the  percent  that 
results  from  dividing  days 
by  average  length  of  stay  for 
capital  PPS  transfer  cases 

Identifies  number  used  to 
determine  a  transfer- 
adjusted  case  mix  index  for 
capital  PPS 

Specifies  nmnber  of 
claim  edit  code  trailers 
present 

Specifies  whether  current 
claim  number  differs 
from  number  in  original 
Near  Line  record 

Indicates  number  of 
diagnosis  trailers 
present 

Indicates  number  of 
procedure  code  trailers 
present 


89    Qaim  Edit  Code  Count     9(01)  328  328 


90    Qaim  Near  line  Original  9(01)  -  329  329 

Beneficiary  Qaim  Number 
Coimt 


91    Qaim  Diagnosis  Code      9(02)  330  331 

Count 


92    Qaim  Procedure  Code      9(02)  332  333 

Count 
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Data  Element  Picture 


93  Qaim  Related  9(02) 
Conditioii  Code 

Count 

94  daim  Related  9(02) 
Occurrence  Code 

Count 

95  Qaim  Occurrence  9(02) 
Span  Code  Count 


96    aaim  Value  Code  9(02) 
Count 


97    Qaim  Revenue  9(02) 
Center  Code  Count 


Positions 
Beg  End 

334  335 


336  337 


338  339 


340  341 


342  343 


Brief  Description 

Indicates  number  of 
condition  code  trailers 
present 

Indicates  number  of 
related  occurrence 
data  trailers  present 

Indicates  niunber  of 
related  occurrence 
span  data  trailers 
present 

Indicates  number  of 
value  data  trailers 
present 

Indicates  number  of 
revenue  center  data 
trailers  present 


Qaim  Edit  Group  occurs  0  to  9  times  depending  on  the  Qaim  Edit  Code  Coimt  of 
the  CWFA  Inpatient  Claim  Record  The  code  count  is  based  on  the  Qaim  Edit  Code 
from  which  index  entries  are  created  for  retrieval. 

98    Qaim  Edit  Code  X(04)  Specifies  edit  results 

that  indicate  data 
deficiencies 


Qaim  Original  Group  occurs  0  to  1  time  depending  on  the  Qaim  Near  line  Original 
Beneficiary  Qaim  Number  Count  of  the  CWFA  Inpatient  Qaim  Record, 

99  Near  Line  Original  X(09)  Original  CAN  derived  for 
Beneficiary  Qaim                                                  retrieval  purposes 
Account  Number  (CAN) 

100  Near  line  Original  BIC    X(02)  Original  BIC  associated 

with  the  original  CAN 
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Data  Element 


Picture         Positions  Brief  Description 

Beg  End 


Claim  Diagnosis  Group  occurs  0  to  99  times  depending  on  the  Qaim  Diagnosis  Code 
Count  of  the  CWFA  Inpatient  Qaim  Record.  The  code  count  is  based  on  the  Claim 
Diagnosis  Code  from  which  index  entries  are  created  for  retrieval 

101  Qaun  Diagnosis  Code       X(05)  Creates  index  entries  for 

easy  retrieval 


Qaim  Procedure  Group  occurs  0  to  99  times  depending  on  the  Qaim  Procedure  Code 
Count  of  the  CWFA  Inpatient  Qaim  Record.  The  code  count  is  based  on  the  Qaim 
Procedure  Code  from  which  index  entries  are  created  for  retrieval 

102  Qaim  Procedure  Code      X(04)  Creates  index  entries  for 

easy  retrieval 

103  Qaim  Procedure  S9(09) 
Performing  Date  COMP 


Claim  Related  Condition  Group  occurs  0  to  99  times  depending  on  the  Qaim  Related 
Condition  Code  Count  of  the  CWFA  Inpatient  Qaim  Record.  The  code  count  is 
based  on  the  Qaim  Related  Condition  Code  from  which  index  entries  are  created  for 
retrieval 


104  Qaim  Related  X(02)  Indicates  a  condition  is 

Condition  Code  is  related  to  an  institutional 

claim  that  may  affect  payer 
processing 


Qaim  Related  Occurrence  Group  occurs  0  to  99  times  depending  on  the  Qaim 
Related  Occurrence  Code  Count  of  the  CWFA  Inpatient  Qaim  Record  The  code 
count  is  based  on  the  Claim  Related  Occurrence  Code  from  which  index  entries  are 
created  for  retrieval. 

105  Qaim  Related  X(02)  Identifies  a  significant  event 

Occurrence  Code  relating  to  an  institutional 

claim  that  may  affect  payer 
processing.  (Related  to  a 
specific  date) 
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Data  Element 


106  Qaim  Related 
Occurrence  Date 


Picture 


S9(09) 
COMP 


Positions 
Beg  End 


Brief  Description 


Date  associated  with  the 
significant  event  in  data 
element  105 


Qaim  Occurrence  Span  Group  occurs  0  to  99  times  depending  on  the  Qaim 
Occurrence  Span  Code  Coimt  of  the  CWFA  Inpatient  Claim  Record.  The  code  coimt 
is  based  on  the  Qaim  Occurrence  Span  Code  from  which  index  entries  are  created  for 
retrievaL 


107  Qaim  Occurrence 
Span  Code 


X(02) 


Identifies  a  significant  event 
relating  to  an  institutional 
claim  that  may  affect  payer 
processing.  (Related  to  a 
time  period,  i.e.,  a  span  of 
dates) 


108  Qaim  Occurrence 
Span  From  Date 


S9(09) 
COMP 


From  date  associated  with 
the  significant  event  in  data 
element  107 


109  Claim  Occurrence 
Span  Thru  Date 


S9(09) 
COMP 


Thru  date  associated  with 
the  significant  event  in  data 
element  107 


Qaim  Value  Group  occurs  0  to  99  times  depending  on  the  Qaim  Value  Code  Coimt 
of  the  CWFA  Inpatient  Qaim  Record.  The  code  count  is  based  on  the  Qaim  Value 
Code  from  which  index  entries  are  created  for  retrievaL 

110  Qaun  Value  Code  X(02)  Indicates  the  value  of  a 

monetary  condition  that  was 
used  by  the  intermediary 
to  process  the  claim 


111  Qaim  Value  Amount  S9(07)V99 

COMP-3 


Amount  related  to  the 
condition  identified  in  the 
Qaim  Value  Code 
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Data  Element 


Picture        Positions  Brief  Description 

Beg  End 


Claim  Revenue  Center  Group  occurs  0  to  99  times  depending  on  the  Qaim  Revenue 
Center  Code  Count  The  code  count  is  based  on  the  Revenue  Center  Code  from 
which  index  entries  are  created  for  retrieval. 


112  Revenue  Center  Code  X(03) 


113  HCFA  Common  X(05) 
Procedure  Coding  System 
(HCPCS)  Service  Code 


114  HCPCS  Initial 
Modifier  Code 


X(02) 


Creates  index  entries  for 
easy  retrieval 

Used  by  carriers  to  pay 
SMI  claims  and  by 
intermediaries  to  indicate 
medical  procedures 

Allows  for  more  specific 
procedure  identification 
for  claim 


115  HCPCS  Second 
Modifier  Code 


X(02) 


Allows  for  second  and 
more  specific  procedure 
identification  for  claim 


116  Revenue  Center  S9(07) 
Unit  Count  COMP-3 


117  Revenue  Center  S9(07)V99 
Total  Charge  Amount  COMP-3 


118  Revenue  Center  S9(07)V99 
Non-Covered  COMP-3 
Charge  Amount 


Quantitative  measure  of 
services  provided  to  a 
beneficiary 

Total  charges  for  all 
accommodations  and 
services  for  a  billing 
period 

Benefidaiy's  charge  amount 
for  services  not  covered  by 
Medicare 
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Standard  Analytical  File 
Physician/Supplier  Part  B 


2.  RECORD  DESCRIPTION  for  Variable/Packed  FUes:* 


Record  Name:  Physidan/Supplier,  Part  B 

Single  Output  [Variable  Length,  Packed,  and  Signed] 


Data  Element  Picture  Positions 

Beg  End 

01  Record  Identification  X(01)  01  01 
Code  (RIC) 

02  Qaim  Near  line  X(01)  02  02 
Record  Version  Code 

03  Beneficiary  Claim  X(09)  03  11 
Account  Number 

04  Category    Equatable  X(02)  12  13 
Beneficiary  Identification 

Code  (BIC) 

05  Beneficiary  X(02)  14  15 
Identification  Code 

(See  Attachment  A) 

06  Beneficiary  Residence  X(02)  16  17 
SSA  Standard  State  Code 

(See  Attachment  B) 

07  Beneficiary  Residence  X(03)  18  20 
SSA  Standard  County 

Code 

08  Beneficiary  State  X(01)  21  21 
Segment  Near  line 

Code 


09  Beneficiary  Mailing  X(05)           22  26 
Contact  ZIP  Code 

10  Beneficiary  Sex  X(01)          27  27 
Identification  Code 


Brief  Description 


Identifies  the  record 
format  being  processed 

Identifies  record  version 
of  Near  line  file 

Identifies  primary 
beneficiary 

Categorizes  groups  of 
BICs 


Identifies  relationship 
between  individual  and 
primary  beneficiary 

Identifies  two  digit 
SSA  code 


Identifies  three  digit 
SSA  code 


Identifies  Near  Line  file 
segment  that  contains 
beneficiary's  record  for 
specific  service  year 

Acmal  code 


Identifies  sex  of 
beneficiary 


A  detailed  description  and  the  coding  scheme  for  each  data  element  are  included  in 
Section  4,  DATA  ELEMENT  DESCRIPTIONS. 
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T| 


Data  Element 


Picture 


11    Beneficiary  Race  Code  X(01) 


12    Beneficiary  Birth  Date 


13  Beneficiary  Medicare 
Status  Code  [MSq 
(See  Attachment  C) 

14  HCFA  Claim 
Processing  Date 


15    Qaim  From  Date 


16    Qaim  Thru  Date 


S9(09) 
COMP 

X(02) 


S9(09) 
COMP 


18  CWFQaim 
Accretion  Date 


19    CWF  Qaim 

Accretion  Number 


S9(09> 
COMP 


89(03) 
COMP-3 


20    Qaim  Disposition  Code  X(02) 


21    Fiscal  Intermediary 
(n)/Carrier 
Identification  Number 


X(05) 


Positions 
Beg  End 

28  28 


29  32 
33  34 

35  38 


89(09)  39  42 
COMP 


89(09)  43  46 
COMP 


17    Beneficiary  Common        X(01)  47  47 

Working  Ffle  (CWF) 
Location  Code 


48  51 


52  53 


54  55 


56  60 


Brief  Description 

Identifies  race  of 
beneficiary 

Date  of  beneficiary's 
birth 

Identifies  reason  for 
beneficiary's  entitlement 


Date  weekly  HCFA 
quality  assurance/ 
database  load  process 
cycle  begins 

First  day  of  billing 
statement  for  services 
rendered 

Last  day  of  billing 
statement  for  services 
rendered 

Identifies  CWF  location 
(host  site)  where 
beneficiary's  record  is 
maintained 

Date  the  claim  is 
posted/processed  to 
master  record 

Sequence  number 
assigned  to  claim  when 
posted/processed  to 
master  record 

Identifies  outcome  of 
processing  of  claim 

Assigned  by  HCFA  to 
authorized  FI  or  carrier 
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Data  Element 


22    Fl/Camer  Claim 
Control  Number 


Picture 


X(15) 


Positions 
Beg  End 

61  75 


23  Fl/Carrier  Qaim 
Receipt  Date 

24  Fl/Carrier  Claim 
Payment  Date 


S9(09)  76  79 

CONfP 

S9(09)  80  83 
COMP 


25    CWFB  Claim  Entry  X(01) 
Code 


84  84 


26    CWFB  Qaim  Payment  X(01) 
Denial  Code 


85  85 


27    CWFB  Qaim 

Referring  Physician 
Profiling  Number 


X(10) 


86  95 


28    CWFB  Qaim 

Provider  Assignment 
Indicator  Switch 


X(01) 


96  96 


29    CWFB  Qaim  Referring  X(06) 
Physician  UPIN  Number 


97  102 


30    Qaim  Edit  Code  Count     9(01)  103  103 


Brief  Description 

Unique  control  number 
assigned  by  FI  or  carrier 
to  claim 

Date  FI  or  carrier 
receives  claim 

Scheduled  date  of 
payment  as  shown  on 
original  claim 

Specifies  whether 
Part  B  claim  is  original, 
full  credit,  or 
replacement  debit 

Indicates  who  received 
payment  or  if  claim  was 
denied 

Identifies  the  number  of 
physician  who  referred 
beneficiary  to  physician 
who  performed  services 
(Assigned  by  carrier) 

Indicates  whether 
provider  accepts 
assignment  for  claim 

Unique  Physician 
Identification  Number 
(UPIN)  of  physician  who 
referred  beneficiary  to 
physician  who  performed 
services 

Specifies  number  of 
claim  edit  code  trailers 
present 
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.1  1 


Data  Element 


Picture 


31    Qaim  Near  line  Original  9(01) 
Beneficiary  Qaim  Nmnber 
Count 


32    Claim  Diagnosis  Code  9(01) 
Count 


33    Qaim  Blood  Data  Count  9(01) 


34    CWFB  Claim  Number  9(02) 
line  Item  Coimt 


Positions 
Beg  End 

104  104 


105  105 

106  106 

107  108 


Brief  Description 


Specifies  whether  current 
claim  number  differs 
from  number  in  original 
Near  line  record 

Indicates  number  of 
diagnosis  trailers  are 
present 

Nimiber  of  blood  data 
trailers  that  are  present 

Number  of  line  items 
associated  with  the  claim 


CWFB  Edit  Group  occurs  0  to  9  times  depending  on  the  Qaim  Edit  Code  Count  of 
the  CWFB  Part  B  Qaim  Record.  The  code  count  is  based  on  the  Qaim  Edit  Code 
from  which  index  entries  are  created  for  retrieval 


35    Qaim  Edit  Code 


X(04) 


Specifies  edit  results 
that  indicate  data 
deficiencies 


CWFB  Original  Group  occurs  0  to  1  time  depending  on  the  Qaim  Near  line  Original 
Beneficiary  Qaim  Number  Coimt  of  the  CWFB  Part  B  Qaim  Record. 

36  Near  Line  Original  X(09)  Original  CAN  derived  for 
Beneficiary  Qaim                                                retrieval  purposes 
Account  Number  (CAN) 

37  Near  line  Original  BIC     X(02)  Original  BIC  associated 

with  the  original  CAN 


CWFB  Qaim  Diagnosis  Group  occurs  0  to  9  times  depending  on  the  Qaim  Diagnosis 
Code  Count  of  the  CWFB  Part  B  Claim  Record.  The  code  count  is  based  on  the 
Qaim  Diagnosis  Code  from  which  index  entries  are  created  for  retrieval. 

38    Qaim  Diagnosis  Code       X(05)  Creates  index  entries  for 

easy  retrieval 
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Data  Element 


Picture       .  Positions  Brief  Description 

Beg  End 


CWFB  Blood  Group  occurs  0  to  1  time  depending  on  the  Qaim  Blood  Data  Count 
of  the  CWFB  Part  B  aaim  Record 


39  Claim  Blood  Pints  S9(03) 
Furnished  Quantity  COMP-3 

40  Qaim  Blood  Pints  S9(03) 
Replaced  Quantity  COMP-3 

41  Qaim  Blood  Pints  S9(03) 
Not  Replaced  Quantity  COMP-3 

42  Qaim  Blood  Deductible  S9(03) 
Pints  Quantity  COMP-3 


Number  of  whole  pints 
furnished  to  beneficiaiy 

Number  of  whole  pints 
that  have  been  replaced 

Number  of  whole  pints 
that  have  not  been 
replaced 

Quantity  of  blood  pints 
applied  to  the  claim 


CWFB  line  Item  Group  occurs  0  to  99  times  depending  on  the  Qaim  Nimiber  line 
Item  Count  of  the  CWFB  Part  B  Qaim  Record.  The  code  count  is  based  on  the 
Qaim  line  Item  Code. 


43    CWFB  HCFA  Provider  X(02) 
Specially  Code 


Used  for  pricing  the 
service  for  this  item  on 
the  claim 


44    CWFB  Provider  Type  X(01) 
Code 


Specifies  the  type  of 
provider  furnishing 
service  for  this  item  on 
the  claim 


45    CWFB  HCFA  Type  X(01) 
Service  Code 


Indicates  the  type  of 
service  (as  defined  in 
HCFA  Medicare  Carrier 
Manual)  for  this  item  on 
the  claim 


46    CWFB  Provider 
Participating 
Indicator  Code 


X(01) 


Indicates  whether  a 
provider  is  participating 
for  this  item  on  the  claim 


47    CWFB  Processing 
Indicator  Code 


X(01) 


Specifies  reason  an  item 
was  allowed  or  denied 
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Data  Element 


Picture         Positioiis  Brief  Description 

Beg  End 


48    CWFB  Payment 
80%/100%  Code 


X(01) 


49    CWFB  Service 

Deductible  Indicator 
Switch 


50    CWFB  Payment 
Indicator  Code 


X(01) 


X(01) 


51    CWFB  Mfle/TimeAJnits  S9(03) 
Services  Count  COMP-3 


52    CWFB  Mile/Time/Umts  X(01) 
Indicator  Code 


53  HCFA  Common 
Procedure  Coding 
System  (HCPCS) 
Service  Code 

54  HCPCS  Initial 
Modifier  Code 


X(05) 


X(02) 


Specifies  amount  that 
represents  80%  or  100% 
of  allowed  charges  less 
deductible  or  100% 
limitation  of  liability  only 

Indicates  whether  the 
service  reflected  on  the 
item  is  subject  to 
deductible 

Indicates  the  payment 
screen  used  to  determine 
the  allowed  charge  for 
the  item  on  the  claim 

Total  units  associated 
with  services  needing  imit 
reporting 

Indicates  units  associated 
with  services  needing  unit 
reporting 

Used  by  carriers  to  pay 
SMI  claims  and  by 
intermediaries  to  indicate 
medical  procedures 

Allows  for  more  specific 
procedure  identification 
for  claim 


55    HCPCS  Second 
Modifier  Code 


56  CWFB  Submitted 
Charge  Amoimt 

57  CWFB  AUowed 
Charge  Amount 


X(02) 


S9(05)V99 
COMP-3 


S9(05)V99 
COMP-3 


Allows  for  second  and 
more  specific  procedure 
identification  for  claim 

Specifies  submitted 
charges  reported  for  the 
item 

Specifies  allowed  charges 
reported  for  the  item 
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Data  Element 


Picture         Positions  Brief  Description 

Beg  End 


58    CWFB  Provider 
Tax  Number 


X(10) 


59    CWFB  Carrier  Pricing  X(02) 
Locality  Code 


SSN  or  employee  ID 
number  of 

provider/supplier  that 
identifies  who  gets 
payment 

Identifies  carrier-specific 
locality  used  for  pricing 
the  service  for  the  item 


60    CWFB  Carrier  Provider  X(02) 
Specialty  Code 


61    CWFB  Carrier  Type  X(02) 
Service  Code 


Specifies  the  carrier's 
specialty  code  for  the 
provider  that  is  used  for 
pricing 

Carrier's  type  of  service 
code  used  for  pricing  the 
item 


62    CWFB  Place  of 
Service  Code 


X(02) 


Indicates  the  place  of 
service 


63    CWFB  1st 
E]q)ense  Date 


S9(09) 
COMP 


Specifies  beginning  date 
(first  expense)  for  this 


service 


64    CWFB  Last 
Expense  Date 


S9(09) 
COMP 


Specifies  ending  date 
(last)  expense  for  this 
service 


65    CWFB  Performing 
Provider  Profiling 
Number 


66    CWFB  Service  Count 


67    CWFB  Line 
Diagnosis  Code 


X(10) 


S9(03) 
COMP-3 


X(05) 


Specifies  profiling  ID 
number  of 

physician/suppUer  who 
performed  the  service 

Identifies  total  number  of 
services  processed  for  the 
item 

Specifies  ICD-9  code  that 
indicates  the  diagnosis 
supporting  this 
procedure /service 
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Data  Element 


68    CWFB  ainical  Lab 
Number 


Picture  Positions 
Beg  End 

X(10) 


Brief  Description 


Identifies  number 
assigned  to  clinical 
laboratoiy  providing 
services 


69    CWFB  Clinical  Lab  S9(05)V99 
Charge  Amount  COMP-3 


70    CWFB  Anesthesia  S9(03) 
Base  Unit  Count  COMP-3 


71  Qaim  Provider  S9(07)V99 
Payment  Amount  COMP-3 

72  Beneficiary  Part  B  S9(03)V99 
Deductible  Liability  COMP-3 
Amount 


Specifies  fee  schedule 
charge  amount  applied 
for  services 

Specifies  base  number  of 
imits  assigned  to 
anesthesia  procedure 

Specifies  amount  paid  to 
provider  for  services  covered 

Specifies  amount  for  which 
beneficiaiy  is  liable 


73    CWFB  Psychiatric, 
Occupational  Therapy, 
Physical  Therapy  Limit 
Amount 


S9(05)V99 
COMP-3 


Specifies  amount  of  allowed 
charges  appViod  to  limit  cap 
for  this  item 


74  CWFB  Line  Blood  S9(03) 
Deductible  Pints  COMP-3 
Quantity 

75  Beneficiary  Primaiy  S9(05)V99 
Payer  Qaim  Payment  COMP-3 
Amoimt 


76  Beneficiary  Primary 
Payer  Code 

77  CWFB  Interest  Amoimt 


X(01) 


S9(05)V99 
COMP-3 


78    CWFB  Performing  X(06) 
Provider  UPIN  Number 


Specifies  quantity 
(deductible)  of  blood  pints 
for  this  item 

Specifies  amount  made  on 
behalf  of  Medicare 
beneficiaiy  by  primaiy  payer 
other  than  Medicare 

Identifies  type  of  primary 
payer  for  this  claim 

Specifies  interest  to  be  paid 
for  this  item 

Identifies  UPIN  of  physician 
who  performed  service 
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Data  Element 


Picture        Positions  Brief  Description 

Beg  End 


79    CWFB  Performing 
Provider  23P  Code 


S(09) 
COMP-3 


80    CWFB  Reduced  Payment  X(01) 
Physician  Assistant 
Code 


Identifies  ZIP  code  of 
physician/supplier  who 
performed  service 

Identifies  claims  paid  a 
reduced  fee  schedule  amount 


81  CWFB  Durable  Medical  S(09) 
Equipment  (DME) 
Coverage  Period  Start 

Date 

82  CWFB  DME  Purchase  S(05)V99 
Price  Amount 


83  CWFB  Additional  X(01) 
Claim  Documentation 
Indicator  Code 

84  CWFB  DME  Medical  S9(03) 
Necessity  Month  Count 


Specifies  the  date  the  DME 
coverage  period  started 


Specifies  the  amount  that 
represents  the  lowest  of  fee 
schedule  for  purchase  of 
DME  or  actual  charge 

S^pedfies  Aether  additional 
claim  documentation  was 
submitted 

Specifies  length  of  need  for 
DME  in  months  from  start 
date  through  the  detemiined 
period  of  need 
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